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NOVEL POLYNUCLEOTIDES AND METHOD OF USE THEREOF 



FIELD OF THE INVENTION 



The present invention relates generally to the identification and isolation of novel nucleic 



acid molecules which constitute at least a portion of full-length cDNA molecules that encode 
5 human polypeptides. 



Extracellular proteins play important roles in, among other things, the formation, 
differentiation and maintenance of multicellular organisms. The fate of many individual cells, 

1 0 e.g., proliferation, migration, differentiation, or interaction with other cells, is typically governed 
by information received from other cells and/or the immediate environment. This information 
is often transmitted by secreted polypeptides (for instance, mitogenic factors, survival factors, 
cytotoxic factors, differentiation factors, neuropeptides, and hormones) which are, in turn, 
received and interpreted by diverse cell receptors or membrane-bound proteins. These secreted 

15 polypeptides or signaling molecules normally pass through the cellular secretory pathway to 
reach their site of action in the extracellular environment. 

Secreted proteins have various industrial applications, including as pharmaceuticals, 
diagnostics, biosensors and bioreactors. Most protein drugs available at present, such as 
thrombolytic agents, interferons, interleukins, erythropoietins, colony stimulating factors, and 

20 various other cytokines, are secretory proteins. Their receptors, which are membrane proteins, 
also have potential as therapeutic or diagnostic agents. Efforts are being undertaken by both 
industry and academia to identify new, native secreted proteins. Many efforts are focused on the 
screening of mammalian recombinant DNA libraries to identify the coding sequences for novel 
secreted proteins. Examples of screening methods and techniques are described in the literature 



25 | see, for example, Klein et al., Proc. Natl. Acad. Sci. , 93:7108-71 13 (1996); U.S. Patent No. 
5,536,637)]. 



Membrane-bound proteins and receptors can play important roles in, among other things, 
the formation, differentiation and maintenance of multicellular organisms. The fate of many 
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individual cells, e.g., proliferation, migration, differentiation, or interaction with other cells, is 
typically governed by information received from other cells and/or the immediate environment. 
This information is often transmitted by secreted polypeptides (for instance, mitogenic factors, 
survival factors, cytotoxic factors, differentiation factors, neuropeptides, and hormones) which 
are, in turn, received and interpreted by diverse cell receptors or membrane-bound proteins. Such 
5 membrane-bound proteins and cell receptors include, but are not limited to, cytokine receptors, 
receptor kinases, receptor phosphatases, receptors involved in cell-cell interactions, and cellular 
adhesin molecules like selectins and integrins. For instance, transduction of signals that regulate 
cell growth and differentiation is regulated in part by phosphorylation of variouscellular proteins. 
Protein tyrosine kinases, enzymes that catalyze that process, can also act as growth factor 

1 0 receptors. Examples include fibroblast growth factor receptor and nerve growth factor receptor. 

Membrane-bound proteins and receptor molecules have various industrial applications, 
including as pharmaceutical and diagnostic agents. Receptor immunoadhesins, for instance, can 
be employed as therapeutic agents to block receptor-ligand interactions. The membrane-bound 
proteins can also be employed for screening of potential peptide or small molecule inhibitors of 

15 the relevant receptor/ligand interaction. Efforts are being undertaken by both industry and 
academia to identify new, native receptor or membrane-bound proteins. Many efforts are focused 
on the screening of mammalian recombinant DNA libraries to identify the coding sequences for 
novel receptor or membrane-bound proteins. 

Recently, significant progress has been made in identifying and isolating unique nucleic 

20 acid moelculcs which encode all or a portion of many mammalian proteins. We herein describe 
the identification and characterization of novel polynucleotides which constitute at least partial 
cDNA molecules that encode various human polypeptides. 

SUMMARY OF THE INVENTION 
25 Novel polynucleotides have been identified and isolated which constitute at least partial 

cDNA molecules that encode human polypeptides. 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising 
any one of the nucleic acid sequences shown in the accompanying figures, or the complement 
thereof, or polynucleotide variants of those nucleic acid sequences as defined below. 
30 In another embodiment, the invention provides an isolated nucleic acid molecule 

consisting essentially of any one of the nucleic acid sequences shown in the accompanying 
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figures, or the complement thereof , or polynucleotide variants ol those nucleic acid sequences 
as defined helow. 

In another embodiment, the invention provides an isolated nucleic acid molecule 
consisting of any one of the nucleic acid sequences shown in the accompanying figures, or the 
complement thereof, or polynucleotide valiants of those nucleic acid sequences as defined below. 
5 In yet another embodiment, the invention provides an isolated nucleic acid molecule that 

comprises a nucleotide sequence having at least about 80% sequence identity, preferably at least 
about 81% sequence identity, more preferabl> at least about 82% sequence identity, yet more 
preferably at least about 83' h sequence identity, yet more preferably at least about 84' c sequence 
identity, yet more preferably at least about 85% sequence identity, yet more preferably at least 

1 0 about 86* ^ sequence identity, yet more preferably at least about 87 % sequence identity, yet more 
preferabh at least about 88' h sequence identity, yet more preferably at least about 89' b sequence 
identity, yet more preferably at least about 90% sequence identity, yet more preferably at least 
about 91% sequence identity, yet more preferably at least about 92' * sequence identity, yet more 
preferably at least about 93' < sequence identity, yet more preferably at least about 94' c sequence 

15 identity, set more preferably at least about 95%' sequence identity, yet more preferably at least 
about 96' V) sequence identity, yet more preferably at least about 97' h sequence identity, yet more 
preferabh at least about 9S% sequence identity and yet more prelerably at least about 99% 
sequence ident ity to (a) the I)N A molecule of any one of Figure I to 562, or (b) the complement 
of the DN A molecule of (a ). 

20 

In another aspect, the isolated nucleic acid molecule consists essentially of a nucleotide 
sequence having at least about 80% sequence identity, preferably at least about 81%> sequence 
identity, more preferably at least about 82% sequence identity, yet more preferably at least about 
83%> sequence identity, yet more preferably at least about 84% sequence identity, yet more 

25 preferably at least about 85' sequence identity, yet more preferably at least about 86% sequence 
identity, yet more preferably at least about 87% sequence identity, yet more preferably at least 
about 88% sequence identity, yet more preferably at least about 89% sequence identity, yet more 
preferably at least about 90' r. sequence identity, yet more preferabh at least about 91 % sequence 
identity, \et more preferably at least about 92% sequence identity, yet more preferably at least 

30 about 93% sequence identity, yet more preferably at least about 94% sequence identity, yet more 
preferably at least about 95%' sequence identity, yet more preferably at least about 96% sequence 
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identity, yet more preferably at least about 97% sequence identity, yet moie preferably at least 
about 98%< sequence identity and yet more preferably at least about 99% sequence identity to (a) 
the DNA molecule of any one of Figure 1 to 562. or (b) the complement ol the DNA molecule 
of (a). 

In yet another aspect, the isolated nucleic acid molecule consists of a nucleotide sequence 
5 having at least about 80% sequence identity, preferably at least about 81% sequence identity, 
more preferably at least about 82% sequence identity, yet more preferably at least about 83% 
sequence identity, yet more preferably at least about 84%> sequence identity, yet more preferably 
at least about 85% sequence identity, yet more preferably at least about 86% sequence identity, 
yet more preferably at least about 87% sequence identity, yet more preferably at least about 88' b 

1 0 sequence identity, yet more preferably at least about 89* b sequence identity, yet more preferably 
at least about 90% sequence identity, yet more preferably at least about 91 % sequence identity, 
yet more preferably at least about 92% sequence identity, yet more preferably at least about 93% 
sequence identity, yet more preferably at least about 94 %> sequence identity, yet more preferably 
at least about 95% sequence identity, yet more preferably at least about 96 ( . b sequence identity, 

1 5 yet more preferably at least about 97% sequence identity , yet more preferably at least about 98 f b 
sequence identity and yet more preferably at least about 99% sequence identity to (a) the DNA 
molecule of any one of Figure 1 to 562, or (b) the complement of the DNA molecule of (a). 

In another embodiment, the invention concerns an isolated nucleic acid molecule which 
comprises a nucleotide sequence that hybridizes to (a) the DNA molecule of any one of Figure 

20 1 to 562, or (b) the complement of the DNA molecule of (a). Preferably, hybridization occurs 
under stringent hybridization and wash conditions. Also, it is preferred that the isolated nucleic 
acid molecule is lully complementary to (a) the DNA molecule of any one of Figure 1 to 562, or 
(b) the complement of the DNA molecule of (a). 

In yet another embodiment, the present invention provides an isolated nucleic acid 

25 molecule which comprises at least about 10 consecutive nucleotides contained within (a) the 
DNA molecule of any one of Figure 1 to 562, or (b) the complement of the DNA molecule of (a) 
which may find use as, for example, hybridizing oligonucleotide probes or for encoding 
polypeptide fragments that may optionally comprise a binding site for an antibody. In particular 
aspects, the isolated nucleic acid molecule is from about 10 to about 1 000, about 1 0 to about 900, 

30 about 10 to about 800, about 10 to about 700, about 10 to about 600, about 10 to about 500, 
about 10 to about 400, about 10 to about 300, about 10 to about 200, about 10 to about 100, 
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about 10 to about 90, about 10 to about 80. about 10 to about 70, about 10 to about 60, about 10 
to about 50, about 10 to about 40, about 10 to about 30 or about 10 to about 20 nucleotides in 
length, where the term "about" means the referenced nucleotide sequence length plus or minus 
lO'/r of that referenced length. In yet other aspects, the isolated nucleic acid molecule comprises 
at least about 15, 20, 25, 30, 35, 40, 45, 50, 55. 60, 65, 70. 75, 80, 85. 90, 95 or 100 consecutive 
5 nucleotides contained within (a) the DNA molecule of any one of Figure 1 to 562, or (b) the 
complement of the DNA molecule of (a). 

The present invention is also directed to a method of using an oligonucleotide probe 
having a nucleotide sequence derived from a nucleic acid molecule described herein for detecting 
the presence of and/or obtaining a full-length mammalian cDNA molecule from a mammalian 
1 0 cDNA library which encodes a mammalian polypeptide. Preferably, the mammal is human. The 
methods comprise the step of screening a mammalian cDNA library with one or more of the 
herein described oligonucleotides to detect the presence of a full-length cDNA and, optionally, 
obtaining the full-length cDNA from that library. 

In another embodiment, the invention provides a vector comprising any of the isolated 
1 5 nucleic acid molecules described herein or their variants. 

A host cell comprising such a vector is also provided. By way of example, the host cells 
may be CHO cells, E. coli, or yeast. A process for producing polypeptides is further provided 
and comprises culturing the host cells under conditions suitable for expression of a polypeptide 
and recovering the polypeptide from the cell culture. 
20 In another embodiment, the invention provides isolated polypeptides encoded by any of 

the isolated nucleic acids described herein, wherein thise polypeptides are herein designated as 
SRT polypeptides. 

In yet another embodiment, the invention provides antibodies which specifically bind to 
a polypeptide encoded by a nucleic acid molecule described herein. Preferably, the antibodies 
25 are monoclonal antibodies. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure I shows a nucleotide sequence (SEQ ID NO: 1 ) designated herein as DNA8284. 
Figure 2 shows a nucleotide sequence (SEQ ID NO:2) designated herein as DNA8328. 
30 Figure 3 shows a nucleotide sequence (SEQ ID NO:3) designated herein as DNA8350. 

Figure 4 shows a nucleotide sequence (SEQ ID NO:4) designated herein as DNA8369. 
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Figure 5 shows a nucleotide sequence (SEQ ID NO. 5) designated herein as DNA8377. 
Figure 6 shows a nucleotide sequence (SEQ ID NO 6) designated herein as DNA8456. 
Figure 7 shows a nucleotide sequence (SEQ ID NO 7) designated herein as DNA8555. 
Figure 8 shows a nucleotide sequence (SEQ ID NO 8) designated herein as DNA8576. 
Figure 9 shows a nucleotide sequence (SEQ ID NO:9) designated herein as DNA9383. 
5 Figure 10 shows a nucleotide sequence (SEQ ID NO: 10) designated herein as DNA9840. 

Figure 11 shows a nucleotide sequence (SEQ ID NO: 1 1 ) designated herein as 
DNA 10028. 

Figure 12 shows a nucleotide sequence (SEQ ID NO: 12) designated herein as 
DNA 10072. 

10 Figure 13 shows a nucleotide sequence (SEQ ID NO: 13) designated herein as 

DNA 10242. 

Figure 14 shows a nucleotide sequence (SEQ ID NO: 14) designated herein as 
DNA10281. 

Figure 15 shows a nucleotide sequence (SEQ ID NO: 15) designated herein as 
15 DNA 1 2628. 

Figure 16 shows a nucleotide sequence (SEQ ID NO: 16) designated herein as 
DNA 1 2646. 

Figure 17 shows a nucleotide sequence (SEQ ID NO: 17) designated herein as 
DNA 1 2655. 

20 Figure 18 shows a nucleotide sequence (SEQ ID NO: 18) designated herein as 

DNA 12660. 

Figure 19 shows a nucleotide sequence (SEQ ID NO: 19) designated herein as 
DNA 12668. 

Figure 20 shows a nucleotide sequence (SEQ ID NO:20) designated herein as 
25 DNA 12726. 

Figure 21 shows a nucleotide sequence (SEQ ID NO:21) designated herein as 
DNA 12728. 

Figure 22 shows a nucleotide sequence (SEQ ID NO:22) designated herein as 
DNA 12729. 

30 Figure 23 shows a nucleotide sequence (SEQ ID NO:23) designated herein as 

DNA 12732. 
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Figure 24 shows a nucleotide 
DNAI2733. 

Figure 25 shows a nucleotide 
DNA1274I. 

Figure 26 shows a nucleotide 
5 DNA12742. 

Figure 27 shows a nucleotide 
DNA 12747. 

Figure 28 shows a nucleotide 
DNA12752. 
10 Figure 29 shows a nucleotide 

DNA 12797. 

Figure 30 shows a nucleotide 
DNA 12801. 

Figure 31 shows a nucleotide 
15 DNA 12802. 

Figure 32 shows a nucleotide 
DNA 128 17. 

Figure 33 shows a nucleotide 
DNA12819. 
20 Figure 34 shows a nucleotide 

DNA 12829. 

Figure 35 shows a nucleotide 
DNA I 2830. 

Figure 36 shows a nucleotide 
25 DNA 12834. 

Figure 37 shows a nucleotide 
DNA 12837. 

Figure 38 shows a nucleotide 
DNA 12840. 
30 Figure 39 shows a nucleotide 

DNA 12841. 




sequence (SEQ ID NO 24) designated herein as 

sequence (SEQ ID NO 25) designated herein as 

sequence (SEQ ID NO 26) designated herein as 

sequence (SEQ ID NO. 27) designated herein as 

sequence (SEQ ID NO 28) designated herein as 

sequence (SEQ ID NO. 29) designated herein as 

sequence (SEQ ID NO:30) designated herein as 

sequence (SEQ ID NO:31) designated herein as 

sequence (SEQ ID NO: 32) designated herein as 

sequence (SEQ ID NO:33) designated herein as 

sequence (SEQ ID NO:34) designated herein as 

sequence (SEQ ID NO: 35) designated herein as 

sequence (SEQ ID NO:36) designated herein as 

sequence (SEQ ID NO: 37) designated herein as 

sequence (SEQ ID NO:38) designated herein as 

sequence (SEQ ID NO:39) designated herein as 




Figure 40 shows a nucleotide 
DN A 12844. 

Figure 41 shows a nucleotide 
DNAI2846. 

Figure 42 shows a nucleotide 
5 DN A 12850. 

Figure 43 shows a nucleotide 
DNA 12865. 

Figure 44 shows a nucleotide 
DNA 12867. 
10 Figure 45 shows a nucleotide 

DNA12884. 

Figure 46 shows a nucleotide 
DNA 1 2889. 

Figure 47 shows a nucleotide 
15 DNA 12891. 

Figure 48 shows a nucleotide 
DNA 12900. 

Figure 49 shows a nucleotide 
DNA 1 2922. 
20 Figure 50 shows a nucleotide 

DNA 12946. 

Figure 51 shows a nucleotide 
DNA12 C >67. 

Figure 52 shows a nucleotide 
25 DNA 12974. 

Figure 53 shows a nucleotide 
DNA 12982. 

Figure 54 shows a nucleotide 
DNA 12983. 
30 Figure 55 shows a nucleotide 

DNA 1 299 1 . 




sequence (SHQ ID NO:40) designated herein as 

sequence (SEQ ID NO:41) designated herein as 

sequence (SEQ ID NO:42) designated herein as 

sequence (SHQ ID NO:43) designated herein as 

sequence (SEQ ID NO:44) designated herein as 

sequence (SEQ ID NO:45) designated herein as 

sequence (SEQ ID NO:46) designated herein as 

sequence (SEQ ID NO:47) designated herein as 

sequence (SEQ ID NO:48) designated herein as 

sequence (SEQ ID NO:49) designated herein as 

sequence (SEQ ID NO: 50) designated herein as 

sequence (SEQ ID NO:5I) designated herein as 

sequence (SEQ ID NO:52) designated herein as 

sequence (SEQ ID NO:53) designated herein as 

sequence (SEQ ID NO:54) designated herein as 

sequence (SEQ ID NO:55) designated herein as 
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Figure 56 shows a nucleotide sequence (SEQ ID NO:56) designuted herein as 
DNA 1 2998. 

Figure 57 shows a nucleotide sequence (SEQ ID NO:57) designated herein as 
DNA 12999. 

Figure 58 shows a nucleotide sequence (SEQ ID NO: 58) designated herein as 
5 DNA13101. 

Figure 59 shows a nucleotide sequence (SEQ ID NO:59) designated herein as 
DNA 13 104. 

Figure 60 shows a nucleotide sequence (SEQ ID NO:60) designated herein as 
DNA 1 3 1 10. 

10 Figure 61 shows a nucleotide sequence (SEQ ID NO:ol) designated herein as 

DNA131 14. 

Figure 62 shows a nucleotide sequence (SEQ ID NO:62) designated herein as 
DNA131 15. 

Figure 63 shows a nucleotide sequence (SEQ ID NO:03) designated herein as 
15 DNA13116. 

Figure 04 shows a nucleotide sequence (SEQ ID NO:64) designated herein as 
DNA 13 I 18. 

Figure 65 shows a nucleotide sequence (SEQ ID NO:65) designated herein as 
DNA 13 124. 

20 Figure 66 shows a nucleotide sequence (SEQ ID NO:66) designated herein as 

DNA 13 132. 

Figure 67 shows a nucleotide sequence (SEQ ID NO:67) designated herein as 
DNA13I33. 

Figure 68 shows a nucleotide sequence (SEQ ID NO:68) designated herein as 
25 DNA13146. 

Figure 69 shows a nucleotide sequence (SEQ ID NO:69) designated herein as 
DNAI3152. 

Figure 70 shows a nucleotide sequence (SEQ ID NO:70) designated herein as 
DNA13156. 

30 Figure 71 shows a nucleotide sequence (SEQ ID NO:71) designated herein as 

DNA 13 163. 




Figure 72 shows a nucleotide 
DNA 13185. 

Figure 73 shows a nucleotide 
DNA 1 3992. 

Figure 74 shows a nucleotide 
5 DN A 14523. 

Figure 75 shows a nucleotide 
DN A 14656. 

Figure 76 shows a nucleotide 
DN A 14938. 
10 Figure 77 shows a nucleotide 

DNA15I72. 

Figure 78 shows a nucleotide 
DNA15618. 

Figure 79 shows a nucleotide 
15 DNA 16546. 

Figure 80 shows a nucleotide 
DNAI6669. 

Figure 81 shows a nucleotide 
DNA 17244. 
20 Figure 82 shows a nucleotide 

DNA 18382. 

Figure 83 shows a nucleotide 
DNA 18444. 

Figure 84 shows a nucleotide 
25 DNA 18649. 

Figure 85 shows a nucleotide 
DNA 19597. 

Figure 86 shows a nucleotide 
DNA 19601. 
30 Figure 87 shows a nucleotide 

DNA21386. 




sequence (SEQ ID NO: 72) designated herein as 

sequence (SEQ ID NO:73) designated herein as 

sequence (SEQ ID NO: 74) designated herein as 

sequence (SEQ ID NO:75) designated herein as 

sequence (SEQ ID NO:76) designated herein as 

sequence (SEQ ID NO:77) designated herein as 

sequence (SEQ ID NO:78) designated herein as 

sequence (SEQ ID NO:79) designated herein as 

sequence (SEQ ID NO:80) designated herein as 

sequence (SEQ ID NO:81) designated herein as 

sequence (SEQ ID NO:82) designated herein as 

sequence (SEQ ID NO:83) designated herein as 

sequence (SEQ ID NO:84) designated herein as 

sequence (SEQ ID NO:85) designated herein as 

sequence (SEQ ID NO:86) designated herein as 

sequence (SEQ ID NO:87) designated herein as 
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Figure 88 shows a nucleotide 
DNA22868. 

Figure 89 shows a nucleotide 
DNA23694. 

Figure C H) shows a nucleotide 
5 DNA240.S0. 

Figure 91 shows a nucleotide 
DNA24074. 

Figure 92 shows a nucleotide 
DNA24787. 
10 Figure c >3 shows a nucleotide 

DNA28242. 

Figure 94 shows a nucleotide 
DNA28254. 

Figure l >5 shows a nucleotide 
15 DNA31751. 

Figure 96 shows a nucleotide 
DNA32922. 

Figure 97 shows a nucleotide 
DNA33439. 
20 Figure 98 shows a nucleotide 

DNA34508. 

Figure 99 shows a nucleotide 
DNA34807. 

Figure 100 shows a nucleotide 
25 DNA34832. 

Figure 101 shows a nucleotide 
DNA36223. 

Figure 102 shows a nucleotide 
DNA36240. 
30 Figure 103 shows a nucleotide 

DNA36490. 




sequence (SEQ ID NO: 88) designated herein as 

sequence (SEQ) ID NO: 89) designated herein as 

sequence (SEQ) ID NO:90) designated herein as 

sequence (SEQ ID NO:9l) designated herein as 

sequence (SEQ ID NO:92) designated herein as 

sequence (SEQ ID NO: ( >3) designated herein as 

sequence (SEQ) ID NO: l »4) designated herein as 

sequence (SEQ ID NO:<>5) designated herein as 

sequence (SEQ ID NO:<>6) designated herein as 

sequence (SEQ ID NO:97) designated herein as 

sequence (SEQ ID NO:98) designated herein as 

sequence (SEQ ID NO:99) designated herein as 

sequence (SEQ ID NO: 100) designated herein as 

sequence (SEQ ID NO: 101) designated herein as 

sequence (SEQ ID NO: 102) designated herein as 

sequence (SEQ ID NO: 103) designated herein as 



Figure 104 shows a nucleotide sequence (SEQ ID NO: 104) designated herein as 
DNA36516. 

Figure 105 shows a nucleotide sequence (SEQ ID NO: 1 05 ) designated herein as 
DNA36533. 

Figure 106 shows a nucleotide sequence (SEQ ID NO: 106) designated herein as 
5 DNA36538. 

Figure 107 shows a nucleotide sequence (SEQ ID NO: 107) designated herein as 
DNA36788. 

Figure 108 shows a nucleotide sequence (SEQ ID NO: 108) designated herein as 
DNA36818. 

10 Figure 109 shows a nucleotide sequence (SEQ ID NO: 1 09) designated herein as 

DNA36868. 

Figure 110 shows a nucleotide sequence (SEQ ID NO: 110) designated herein as 
DNA37393. 

Figure 111 shows a nucleotide sequence (SEQ ID NO: Ml) designated herein as 
15 DNA27588. 

Figure I 12 shows a nucleotide sequence (SEQ ID NO: 112) designated herein as 
DNA37602. 

Figure 113 shows a nucleotide sequence (SEQ ID NO: 113) designated herein as 
DNA37642. 

20 Figure 114 shows a nucleotide sequence (SEQ ID NO: 114) designated herein as 

DNA37676. 

Figure 115 shows a nucleotide sequence (SEQ ID NO: 115) designated herein as 
DNA37721. 

Figure 116 shows a nucleotide sequence (SEQ ID NO: I 16) designated herein as 
25 DNA37759. 

Figure 117 shows a nucleotide sequence (SEQ ID NO: 117) designated herein as 
DNA37857. 

Figure 118 shows a nucleotide sequence (SEQ ID NO: 118) designated herein as 
DNA37937. 

30 Figure 119 shows a nucleotide sequence (SEQ ID NO: 119) designated herein as 

DNA38037. 
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Figure 120 shows a nucleotide sequence (SEQ ID NO: 120) designated herein as 
DNA38050. 

Figure 121 shows a nucleotide sequence (SFQ ID NO: 121) designated herein as 
DNA38053. 

Figure 122 shows a nucleotide sequence {SFQ ID NO: 1 22) designated herein as 
5 DNA38312. 

Figure 123 shows a nucleotide sequence (SFQ ID NO: 123) designated herein as 
DNA38360. 

Figure 124 shows a nucleotide sequence (SEQ ID NO: 124) designated herein as 
DNA38600. 

10 Figure 1 25 shows a nucleotide sequence (SEQ ID NO: 125) designated herein as 

DNA38720. 

Figure 126 shows a nucleotide sequence (SEQ ID NO: 126) designated herein as 
DNA38727. 

Figure 127 shows a nucleotide sequence (SEQ ID NO: 1 27) designated herein as 
15 DNA38731. 

Figure 1 28 shows a nucleotide sequence (SEQ ID NO: 1 28) designated herein as 
DNA38810. 

Figure 129 shows a nucleotide sequence (SEQ ID NO: 129) designated herein as 
DNA38814. 

20 Figure 130 shows a nucleotide sequence (SEQ ID NO: 130) designated herein as 

DNA39378. 

Figure 131 shows a nucleotide sequence (SEQ ID NO: 131) designated herein as 
DNA40050. 

Figure 132 shows a nucleotide sequence (SEQ ID NO: 132) designated herein as 
25 DNA40375. 

Figure 133 shows a nucleotide sequence (SEQ ID NO: 133) designated herein as 
DNA40382. 

Figure 134 shows a nucleotide sequence (SEQ ID NO: 134) designated herein as 
DNA40394. 

30 Figure 135 shows a nucleotide sequence (SEQ ID NO: 135) designated herein as 

DNA40461. 

13 





Figure 


136 


4 

shows 


a 


nucleotide 


sequence 


(SFQ 


• 

ID NO:l36) designated 


herein 


as 




DNA40735. 






















Figure 


137 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID NO: 1 37) designated 


herein 


as 




DNA40736. 






















Figure 


138 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID NO: 1 38) designated 


herein 


as 


5 


DNA40738. 






















Figure 


139 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID NO: 1 39) designated 


herein 


as 




DNA40739. 






















Figure 


140 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID NO:l40) designated 


herein 


as 




DNA41 144. 




















10 


Figure 


141 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID NO:l41) designated 


herein 


as 




DNA41 161. 






















Figure 


142 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID NO: 142) designated 


herein 


as 




DNA41 186. 






















Figure 


143 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID NO: 143) designated 


herein 


as 


15 


DNA41250. 






















Figure 


144 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID NO: 144) designated 


herein 


as 




DNA41284. 






















Figure 


145 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID NO: 145) designated 


herein 


as 




DNA41303. 




















20 


Figure 


146 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID NO: 146) designated 


herein 


as 




DNA41326. 






















Figure 


147 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID NO: 147) designated 


herein 


as 




DNA4I444. 






















Figure 


148 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID NO: 148) designated 


herein 


as 


25 


DNA41445. 






















Figure 


149 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID NO: 149) designated 


herein 


as 




DNA41452. 






















Figure 


150 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID NO: 150) designated 


herein 


as 




DNA41456. 




















30 


Figure 


151 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID NO: 151) designated 


herein 


as 




DNA41458. 
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Figure 152 shows a nucleotide sequence (SEQ ID NO: 152) designated herein as 
DNA41462. 

Figure 153 shows a nucleotide sequence (SEQ ID NO: 153) designated herein as 
DNA414<>5. 

Figure 154 shows a nucleotide sequence (SEQ ID NO: 154) designated herein as 
5 DNA41475. 

Figure 155 shows a nucleotide sequence (SEQ ID NO: 155) designated herein as 
DNA415I4. 

Figure 156 shows a nucleotide sequence (SEQ ID NO: 156) designated herein as 
DNA415o5. 

10 Figure 157 shows a nucleotide sequence (SEQ ID NO: 157) designated herein as 

DNA41566. 

Figure 158 shows a nucleotide sequence (SEQ ID NO: 158) designated herein as 
DNA4162(). 

Figure 159 shows a nucleotide sequence (SEQ ID NO: 159) designated herein as 
15 DNA4I709. 

Figure 160 shows a nucleotide sequence (SEQ ID NO: 160) designated herein as 
DNA41775. 

Figure 161 shows a nucleotide sequence (SEQ ID NO: 161) designated herein as 
DNA41784. 

20 Figure 162 shows a nucleotide sequence (SEQ ID NO: 162) designated herein as 

DNA42194. 

Figure 163 shows a nucleotide sequence (SEQ ID NO: 163) designated herein as 
DNA42279. 

Figure 164 shows a nucleotide sequence (SEQ ID NO: 164) designated herein as 
25 DNA423I4. 

Figure 165 shows a nucleotide sequence (SEQ ID NO: 165) designated herein as 
DNA42331. 

Figure 166 shows a nucleotide sequence (SEQ ID NO: 166) designated herein as 
DNA42358. 

30 Figure 167 shows a nucleotide sequence (SEQ ID NO: 167) designated herein as 

DNA42858. 
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Figure 168 shows a nucleotide sequence (SEQ ID NO: 168) designated herein as 
DNA42870. 

Figure 169 shows a nucleotide sequence (SF^Q ID NO: 169) designated herein as 
DNA42875. 

Figure 170 shows a nucleotide sequence (SEQ ID NO: 170) designated herein as 
5 DNA43197. 

Figure 171 shows a nucleotide sequence (SEQ ID NO: 171) designated herein as 
DNA43203. 

Figure 172 shows a nucleotide sequence (SEQ ID NO: 172) designated herein as 
DNA43295. 

10 Figure 1 73 shows a nucleotide sequence (SEQ ID NO: 173) designated herein as 

DNA43301. 

Figure 174 shows a nucleotide sequence (SEQ ID NO: 174) designated herein as 
DNA43363. 

Figure 175 shows a nucleotide sequence (SEQ ID NO: 175) designated herein as 
1 5 DNA43420. 

Figure 176 shows a nucleotide sequence (SEQ ID NO: 176) designated herein as 
DNA443479. 

Figure 177 shows a nucleotide sequence (SEQ ID NO: 177) designated herein as 
DNA43489. 

20 Figure 178 shows a nucleotide sequence (SEQ ID NO: 178) designated herein as 

DNA43498. 

Figure 179 shows a nucleotide sequence (SEQ ID NO: 179) designated herein as 
DNA43509. 

Figure 180 shows a nucleotide sequence (SEQ ID NO: 180) designated herein as 
25 DNA435I2. 

Figure 181 shows a nucleotide sequence (SEQ ID NO: 181) designated herein as 
DNA4353I. 

Figure 182 shows a nucleotide sequence (SEQ ID NO: 182) designated herein as 
DNA43546. 

30 Figure 183 shows a nucleotide sequence (SEQ ID NO: 183) designated herein as 

DNA43586. 

16 




Figure 184 shows a nucleotide sequence (SEQ ID NO: 184) designated herein as 
DNA43862. 

Figure 185 shows a nucleotide sequence (SHQ ID NO: 185) designated herein as 
DNA43887. 

Figure 186 shows a nucleotide sequence (SEQ ID NO: 18b) designated herein as 
5 DNA43936. 

Figure 187 shows a nucleotide sequence (SEQ ID NO: 187) designated herein as 
DNA43961. 

Figure 188 shows a nucleotide sequence (SEQ ID NO: 188) designated herein as 
DNA43971. 

10 F igure 189 shows a nucleotide sequence (SEQ ID NO: 1 89) designated herein as 

DNA44048. 

Figure 190 shows a nucleotide sequence (SEQ ID NO: 190) designated herein as 
DNA44920. 

Figure 191 shows a nucleotide sequence (SEQ ID NO: 191) designated herein as 
15 DNA44922. 

Figure 192 shows a nucleotide sequence (SEQ ID NO: 192) designated herein as 
DNA44934. 

Figure 193 shows a nucleotide sequence (SEQ ID NO: 193) designated herein as 
DNA44987. 

20 Figure 194 shows a nucleotide sequence (SEQ ID NO: 194) designated herein as 

DNA45014. 

Figure 195 shows a nucleotide sequence (SEQ ID NO 195) designated herein as 
DNA45030. 

Figure 196 shows a nucleotide sequence (SEQ ID NO: 196) designated herein as 
25 DNA45051. 

Figure 197 shows a nucleotide sequence (SEQ ID NO: 197) designated herein as 
DNA45064. 

Figure 198 shows a nucleotide sequence (SEQ ID NO: 198) designated herein as 
DNA45282. 

30 Figure 199 shows a nucleotide sequence (SEQ ID NO: 199) designated herein as 

DNA45288. 
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Figure 200 shows u nucleotide 
DN A45300. 

Figure 201 shows a nucleotide 
DNA45740. 

Figure 202 shows a nucleotide 
5 DNA45759. 

Figure 203 shows a nucleotide 
DNA45784. 

Figure 204 shows a nucleotide 
DNA45789. 
10 Figure 205 shows a nucleotide 

DNA45816. 

Figure 206 shows a nucleotide 
DNA45944. 

Figure 207 shows a nucleotide 
15 DNA45954. 

Figure 208 shows a nucleotide 
DNA45964. 

Figure 209 shows a nucleotide 
DNA45993. 
20 Figure 210 shows a nucleotide 

DNA46092. 

Figure 211 shows a nucleotide 
DNA46213. 

Figure 212 shows a nucleotide 
25 DNA46215. 

Figure 213 shows a nucleotide 
DNA46226. 

Figure 214 shows a nucleotide 
DNA46328. 
30 Figure 215 shows a nucleotide 

DNA47580. 




sequence (SFQ ID NO:200) designated herein as 

sequence (SFQ ID NO:201) designated herein as 

sequence (SEQ ID NO:202) designated herein as 

sequence (SEQ ID NO:203) designated herein as 

sequence (SEQ ID NO:204) designated herein as 

sequence (SEQ ID NO: 205) designated herein as 

sequence (SEQ) ID NO:206) designated herein as 

sequence (SEQ ID NO:207) designated herein as 

sequence (SEQ ID NO:208) designated herein as 

sequence (SEQ ID NO:209) designated herein as 

sequence (SEQ ID NO:210) designated herein as 

sequence (SEQ ID NO:211) designated herein as 

sequence (SEQ ID NO:212) designated herein as 

sequence (SEQ ID NO:213) designated herein as 

sequence (SEQ ID NO:214) designated herein as 

sequence (SEQ ID NO:2I5) designated herein as 
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Figure 216 shows a nucleotide sequence (SEQ ID NO:216) designated herein as 
DNA47691. 

Figure 217 shows a nucleotide sequence (SFQ ID NO:2I7) designated herein as 
DNA47751. 

Figure 218 shows a nucleotide sequence (SEQ) ID NO:218) designated herein as 
5 DNA47835. 

Figure 219 shows a nucleotide sequence (SFQ ID NO:219) designated herein as 
DNA47858. 

Figure 220 shows a nucleotide sequence (SEQ) ID NO:220) designated herein as 
DNA47890. 

10 Figure 221 shows a nucleotide sequence (SEQ ID NO:221) designated herein as 
DNA47930. 

Figure 222 shows a nucleotide sequence (SEQ ID NO:222) designated herein as 
DNA47990. 

Figure 223 shows a nucleotide sequence (SEQ ID NO:223) designated herein as 
15 DNA48054. 

Figure 224 shows a nucleotide sequence (SEQ ID NO:224) designated herein as 
DNA48124. 

Figure 225 shows a nucleotide sequence (SEQ ID NO:225) designated herein as 
DNA48131. 

20 Figure 226 shows a nucleotide sequence (SEQ ID NO:226) designated herein as 
DNA48162. 

Figure 227 shows a nucleotide sequence (SEQ) ID NO:227) designated herein as 
DNA48209. 

Figure 228 shows a nucleotide sequence (SEQ ID NO:228) designated herein as 
25 DNA48389. 

Figure 229 shows a nucleotide sequence (SEQ ID NO: 229) designated herein as 
DNA48446. 

Figure 230 shows a nucleotide sequence (SEQ ID NO:230) designated herein as 
DNA48466. 

30 Figure 231 shows a nucleotide sequence (SEQ ID NO:23I) designated herein as 
DNA48576. 
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Figure 232 shows a nucleotide 
DNA48598. 

Figure 233 shows a nucleotide 
DNA48666. 

Figure 234 shows a nucleotide 
5 DNA48748. 

Figure 235 shows a nucleotide 
DNA48777. 

Figure 236 shows a nucleotide 
DNA48830. 
10 Figure 237 shows a nucleotide 

DNA49352. 

Figure 238 shows a nucleotide 
DNA49407. 

Figure 239 shows a nucleotide 
15 DNA49448. 

Figure 240 shows a nucleotide 
DNA49528. 

Figure 241 shows a nucleotide 
DNA4<»529. 
20 Figure 242 shows a nucleotide 

DNA49948. 

Figure 243 shows a nucleotide 
DNA49956. 

Figure 244 shows a nucleotide 
25 DNA49992. 

Figure 245 shows a nucleotide 
DNA50307. 

Figure 246 shows a nucleotide 
DNA50319. 
30 Figure 247 shows a nucleotide 

DNA50346. 




sequence (SEQ ID NO:232) designated herein as 
sequence (SEQ ID NO:233) designated herein as 
sequence (SEQ ID NO:234) designated herein as 
sequence (SEQ ID NO:235) designated herein as 
sequence (SEQ ID NO:236) designated herein as 
sequence (SEQ ID NO:237) designated herein as 
sequence (SEQ ID NO:238) designated herein as 
sequence (SEQ ID NO:239) designated herein as 
sequence (SEQ ID NO:240) designated herein as 
sequence (SEQ ID NO:241) designated herein as 
sequence (SEQ ID NO: 242) designated herein as 
sequence (SEQ ID NO:243) designated herein as 
sequence (SEQ ID NO: 244) designated herein as 
sequence (SEQ ID NO:245) designated herein as 
sequence (SEQ ID NO:246) designated herein as 
sequence (SEQ ID NO:247) designated herein as 
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• 










• 










Figure 


248 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:248) 


designated 


herein 


as 




DNA50354. 


























Figure 


249 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:249) 


designated 


herein 


as 




DNA50356. 


























Figure 


250 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:250) 


designated 


herein 


as 


5 


DNA50405. 


























Figure 


251 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:251) 


designated 


herein 


as 




DNA50421. 


























Figure 


252 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:252) 


designated 


herein 


as 




DNA50423. 
























10 


Figure 


253 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:253) 


designated 


herein 


as 




DNA50527. 


























Figure 


254 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:254) 


designated 


herein 


as 




DNA50584. 


























Figure 


255 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:255) 


designated 


herein 


as 


15 


DNA50626. 


























Figure 


256 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:256) 


designated 


herein 


as 




DNA50637. 


























Figure 


257 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:257) 


designated 


herein 


as : 




DNA50650. 
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Figure 


258 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:258) 


designated 


herein 


as 




DNA50674. 


























Figure 


259 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:259) 


designated 


herein 


as 




DNA50675. 


























Figure 


260 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:260) 


designated 


herein 


as 


25 


DNA50698. 


























Figure 


261 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:261) 


designated 


herein 


as 




DNA50730. 


























Figure 


262 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:262) 


designated 


herein 


as 




DNA50737. 
























30 


Figure 


263 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:263) 


designated 


herein 


as 




DN AS 1003. 
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Figure 264 shows a nucleotide sequence (SEQ ID NO:264) designated herein as 
DNA5 ] 010. 

Figure 265 shows a nucleotide sequence (SEQ ID NO:265) designated herein as 
DN A3 1059. 

Figure 260 shows a nucleotide sequence (SEQ ID NO:266) designated herein as 
5 DNA51413. 

Figure 267 shows a nucleotide sequence (SEQ ID NO:267) designated herein as 
DNA51712. 

Figure 268 shows a nucleotide sequence (SEQ ID NO:268) designated herein as 
DNA51795. 

10 Figure 26 l > shows a nucleotide sequence (SEQ ID NO:269) designated herein as 
DNA52199. 

Figure 270 shows a nucleotide sequence (SEQ ID NO:270) designated herein as 
DNA52218. 

Figure 27 1 shows a nucleotide sequence (SEQ ID NO:27l) designated herein as 
15 DNA52352. 

Figure 272 shows a nucleotide sequence (SEQ ID NO:272) designated herein as 
DNA54440. 

Figure 273 shows a nucleotide sequence (SEQ ID NO:273) designated herein as 
DNA54552. 

20 Figure 274 shows a nucleotide sequence (SEQ ID NO:274) designated herein as 
DNA54580. 

Figure 215 shows a nucleotide sequence (SEQ ID NO:275) designated herein as 
DNA54623. 

Figure 276 shows a nucleotide sequence (SEQ ID NO:276) designated herein as 
25 DNA54672. 

Figure 277 shows a nucleotide sequence (SEQ ID NO:277) designated herein as 
DNA54840. 

Figure 278 shows a nucleotide sequence (SEQ ID NO:278) designated herein as 
DNA54856. 

30 Figure 279 shows a nucleotide sequence (SEQ ID NO:279) designated herein as 
DNA54882. 



22 





Figure 


280 


4 

shows 


» 

a 


nucleotide 


sequence 


(SEQ 


ID 


• 

NO:280) 


designated 


herein 


as 




DNA54943. 


























Figure 


281 


shows 


a 


nucleotide 


sequence 


(Sf;q 


ID 


NO:281 ) 


designated 


herein 


as 




DNA54970. 


























Figure 


282 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:282) 


designated 


herein 


as 


5 


DNA55I34. 


























Figure 


283 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:283) 


designated 


herein 


as 




DNA55198. 


























Figure 


284 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:284) 


designated 


herein 


as 




DNA55199. 
























10 


Figure 


285 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:285) 


designated 


herein 


as 




DNA55292. 


























Figure 


286 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:286) 


designated 


herein 


as 




DNA55646. 


























Figure 


287 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:287) 


designated 


herein 


as 


15 


DNA56553. 


























Figure 


288 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:288) 


designated 


herein 


as 




DNA56554. 


























Figure 


289 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:289) 


designated 


herein 


as 




DNA56556. 
























20 


Figure 


290 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:290) 


designated 


herein 


as 




DNA56587. 


























Figure 


291 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO;291) 


designated 


herein 


as 




DNA56590. 


























Figure 


292 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:292) 


designated 


herein 


as 


25 


DNA56600. 


























Figure 


293 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:293) 


designated 


herein 


as 




DNA56648. 


























Figure 


294 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:294) 


designated 


herein 


as 




DNA56650. 
























30 


Figure 


295 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:295) 


designated 


herein 


as 




DNA56707. 
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Figure 296 shows a nucleotide sequence (SEQ ID NO:296) designated herein as 
DNA56717. 

Figure 297 shows a nucleotide sequence (SEQ ID NO:297) designated herein as 
DNA58387. 

Figure 298 shows a nucleotide sequence (SEQ ID NO:298) designated herein as 
5 DNA58414. 

Figure 299 shows a nucleotide sequence (SEQ ID NO:299) designated herein as 
DNA58529. 

Figure 300 shows a nucleotide sequence (SEQ ID NO:3()0) designated herein as 
DNA59385. 

10 Figure 301 shows a nucleotide sequence (SEQ ID NO:3()l) designated herein as 

DNA5<>789. 

Figure 302 shows a nucleotide sequence (SEQ ID NO:302) designated herein as 
DNAO0321. 

Figure 303 shows a nucleotide sequence (SEQ ID NO:303) designated herein as 
15 DNA60370. 

Figure 304 shows a nucleotide sequence (SEQ ID NO:304) designated herein as 
DNA60406. 

Figure 305 shows a nucleotide sequence (SEQ ID NO:3()5) designated herein as 
DNAft()438. 

20 Figure 306 shows a nucleotide sequence (SEQ ID NO:306) designated herein as 

DNA60460. 

Figure 307 shows a nucleotide sequence (SEQ ID NO:307) designated herein as 
DNA60466. 

Figure 308 shows a nucleotide sequence (SEQ ID NO: 308) designated herein as 
25 DNA60508. 

Figure 309 shows a nucleotide sequence (SEQ ID NO:309) designated herein as 
DNA60542. 

Figure 310 shows a nucleotide sequence (SEQ ID NO:3I0) designated herein as 
DNA60590. 

30 Figure 311 shows a nucleotide sequence (SEQ ID NO:311) designated herein as 

DNA61350. 
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Figure 312 shows a nucleotide sequence (SEQ ID NO:312) designated herein as 
DNA61356. 

Figure 313 shows a nucleotide sequence (SEQ ID NO:313) designated herein as 
DNA61478. 

Figure 314 shows a nucleotide sequence (SEQ ID NO:314) designated herein as 
5 DNA61513. 

Figure 315 shows a nucleotide sequence (SEQ ID NO:315) designated herein as 
DNA61561. 

Figure 316 shows a nucleotide sequence (SEQ ID NO:316) designated herein as 
DNA61895. 

10 Figure 3I7 shows a nucleotide sequence (SEQ ID NO:3l7) designated herein as 

DNA61930. 

Figure 3 18 shows a nucleotide sequence (SEQ ID NO:318) designated herein as 
DNA61953. 

Figure 319 shows a nucleotide sequence (SEQ ID NC):319) designated herein as 
15 DNA62011. 

Figure 320 shows a nucleotide sequence (SEQ ID NO: 320) designated herein as 
DNA62080. 

Figure 321 shows a nucleotide sequence (SEQ ID NO:321) designated herein as 
DNA62126. 

20 Figure 322 shows a nucleotide sequence (SEQ ID NO:322) designated herein as 

DNA62154. 

Figure 323 shows a nucleotide sequence (SEQ ID NO:323) designated herein as 
DNAO2170. 

Figure 324 shows a nucleotide sequence (SEQ ID NO:324) designated herein as 
25 DNA62193. 

Figure 325 shows a nucleotide sequence (SEQ ID NO:325) designated herein as 
DNA02261. 

Figure 326 shows a nucleotide sequence (SEQ ID NO:326) designated herein as 
DNA62291. 

30 Figure 327 shows a nucleotide sequence (SEQ ID NO:327) designated herein as 

DNA62422. 

25 




Figure 328 shows a nucleotide sequence (SEQ ID NO:328) designated herein as 
DNA6243o. 

Figure 329 shows a nucleotide sequence (SEQ ID NO:329) designated herein as 
DNA62524. 

Figure 330 shows a nucleotide sequence (SEQ ID NO:330) designated herein as 
5 DNA62589. 

Figure 331 shows a nucleotide sequence (SEQ ID NO:331) designated herein as 
DNA63878. 

Figure 332 shows a nucleotide sequence (SEQ ID NO:332) designated herein as 
DNA64017. 

10 Figure 333 shows a nucleotide sequence (SEQ ID NO:333) designated herein as 
DNA64045. 

Figure 334 shows a nucleotide sequence (SEQ ID NO:334) designated herein as 
DNA64101. 

Figure 335 shows a nucleotide sequence (SEQ) ID NO:335) designated herein as 
15 DNA641 83. 

Figure 336 shows a nucleotide sequence (SEQ ID NO:336) designated herein as 
DNA64193. 

Figure 337 shows a nucleotide sequence (SEQ ID NO:337) designated herein as 
DNA64I99. 

20 Figure 338 shows a nucleotide sequence (SEQ ID NO:338) designated herein as 
DNA04268. 

Figure 339 shows a nucleotide sequence (SEQ ID NO: 339) designated herein as 
DNA64304. 

Figure 340 shows a nucleotide sequence (SEQ ID NO:340) designated herein as 
25 DNA64453. 

Figure 341 shows a nucleotide sequence (SEQ ID NO:341) designated herein as 
DNA64458. 

Figure 342 shows a nucleotide sequence (SEQ ID NO:342) designated herein as 
DNA645I2. 

30 Figure 343 shows a nucleotide sequence (SEQ ID NO:343) designated herein as 
DNA64540. 



26 



Figure 344 shows a nucleotide sequence (SEQ ID NO:344) designated herein as 
DNA64552. 

Figure 345 shows a nucleotide sequence (SEQ ID NO:345) designated herein as 
DNA64557. 

Figure 346 shows a nucleotide sequence (SEQ ID NO:346) designated herein as 
5 DNA64569. 

Figure 347 shows a nucleotide sequence (SEQ ID NO:347) designated herein as 
DNA64627. 

Figure 348 shows a nucleotide sequence (SEQ ID NO:348) designated herein as 
DNA64745. 

10 Figure 349 shows a nucleotide sequence (SEQ ID NO:349) designated herein as 

DNA64784. 

Figure 350 shows a nucleotide sequence (SEQ ID NO: 350) designated herein as 
DNA65609. 

Figure 351 shows a nucleotide sequence (SEQ ID NO:351) designated herein as 
15 DNA65644. 

Figure 352 shows a nucleotide sequence (SEQ ID NO:352) designated herein as 
DNA65720. 

Figure 353 shows a nucleotide sequence (SEQ ID NO: 353) designated herein as 
DNA65752. 

20 Figure 354 shows a nucleotide sequence (SEQ ID NO:354) designated herein as 

DNA65771. 

Figure 355 shows a nucleotide sequence (SEQ ID NO: 355) designated herein as 
DNA05833. 

Figure 356 shows a nucleotide sequence (SEQ ID NO: 356) designated herein as 
25 DNA65836. 

Figure 357 shows a nucleotide sequence (SEQ ID NO:357) designated herein as 
DNA65864. 

Figure 358 shows a nucleotide sequence (SEQ ID NO:358) designated herein as 
DNA65869. 

30 Figure 359 shows a nucleotide sequence (SEQ ID NO:359) designated herein as 

DNA65928. 
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Figure 360 shows a nucleotide 
DNA66065. 

Figure 361 shows a nucleotide 
DNA66095. 

Figure 362 shows a nucleotide 
5 DNA66197. 

Figure 363 shows a nucleotide 
DNA66217. 

Figure 364 shows a nucleotide 
DNA66231. 
10 Figure 365 shows a nucleotide 

DNA66404. 

Figure 366 shows a nucleotide 
DNA66432. 

Figure 367 shows a nucleotide 
15 DNA67076. 

Figure 368 shows a nucleotide 
DNA68013. 

Figure 369 shows a nucleotide 
DNA68018. 
20 Figure 370 shows a nucleotide 

DNA68034. 

Figure 371 shows a nucleotide 
DNA08I 19. 

Figure 372 shows a nucleotide 
25 DNA68248. 

Figure 373 shows a nucleotide 
DNA68383. 

Figure 374 shows a nucleotide 
DNA68423. 

30 Figure 375 shows a nucleotide 

DNA68441. 




sequence (SEQ ID NO:360) designated herein as 

sequence (SEQ ID NO:361) designated herein as 

sequence (SEQ ID NO: 3 62) designated herein as 

sequence (SEQ ID NO:363) designated herein as 

sequence (SEQ ID NO:364) designated herein as 

sequence (SEQ ID NO: 365) designated herein as 

sequence (SEQ ID NO:366) designated herein as 

sequence (SEQ ID NO:367) designated herein as 

sequence (SEQ ID NO:368) designated herein as 

sequence (SEQ ID NO:369) designated herein as 

sequence (SEQ ID NO:370) designated herein as 

sequence (SEQ ID NO:371) designated herein as 

sequence (SEQ ID NO: 372) designated herein as 

sequence (SEQ ID NO: 373) designated herein as 

sequence (SEQ ID NO: 374) designated herein as 

sequence (SEQ ID NO:375) designated herein as 
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Figure 376 shows a nucleotide sequence (SEQ ID NO:376) designated herein as 
DNA68459. 

Figure 377 shows a nucleotide sequence (SEQ ID NO:377) designated herein as 
DNA68509. 

Figure 37<S shows a nucleotide sequence (SEQ ID NO:378) designated herein as 
5 DNA685I4. 

Figure 379 shows a nucleotide sequence (SEQ ID NO:379) designated herein as 
DNA08521. 

Figure 380 shows a nucleotide sequence (SEQ ID NO:380) designated herein as 
DNA68532. 

10 Figure 381 shows a nucleotide sequence (SEQ ID NO:381) designated herein as 

DNA68540. 

Figure 382 shows a nucleotide sequence (SEQ ID NO:382) designated herein as 
DNA68561. 

Figure 383 shows a nucleotide sequence (SEQ ID NO: 383) designated herein as 
15 DNA68585. 

Figure 384 shows a nucleotide sequence (SEQ ID NO:384) designated herein as 
DNA69491. 

Figure 385 shows a nucleotide sequence (SEQ ID NO:385) designated herein as 
DNA70222. 

20 Figure 386 shows a nucleotide sequence (SEQ ID NO:386) designated herein as 

DNA70239. 

Figure 387 shows a nucleotide sequence (SEQ ID NO:387) designated herein as 
DNA70244. 

Figure 388 shows a nucleotide sequence (SEQ ID NO:388) designated herein as 
25 DNA70349. 

Figure 389 shows a nucleotide sequence (SEQ ID NO:389) designated herein as 
DNA70400. 

Figure 390 shows a nucleotide sequence (SEQ ID NO: 390) designated herein as 
DNA70413. 

30 Figure 391 shows a nucleotide sequence (SEQ ID NO:391) designated herein as 

DNA70526. 
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Figure 392 shows a nucleotide sequence (SEQ ID NO:392) designated herein as 
DNA70685. 

Figure 393 shows a nucleotide sequence (SEQ ID NO:393) designated herein as 
DNA70732. 

Figure 3 C >4 shows a nucleotide sequence (SEQ ID NO 3 l »4) designated herein as 
5 DNA72634. 

Figure 395 shows a nucleotide sequence (SEQ ID NO:395) designated herein as 
DNA72683. 

Figure 3 l >6 shows a nucleotide sequence (SEQ ID NO:396) designated herein as 
DNA72695. 

10 Figure 397 shows a nucleotide sequence (SEQ ID NO:397) designated herein as 

DNA72864. 

Figure 398 shows a nucleotide sequence (SEQ ID NO 3 ( >8) designated herein as 
DNA73156. 

Figure 399 shows a nucleotide sequence (SEQ ID NO:3 l >9) designated herein as 
15 DNA73275. 

Figure 400 shows a nucleotide sequence (SEQ ID NO 400) designated herein as 
DNA74052. 

Figure 401 shows a nucleotide sequence (SEQ ID NO:401) designated herein as 
DNA74063. 

20 Figure 402 shows a nucleotide sequence (SEQ ID NO:4()2) designated herein as 

DNA74072. 

Figure 403 shows a nucleotide sequence (SEQ ID NO. 403) designated herein as 
DNA74I40. 

Figure 404 shows a nucleotide sequence (SEQ ID NO:4()4) designated herein as 
25 DNA742I6. 

Figure 405 shows a nucleotide sequence (SEQ ID NO:4()5) designated herein as 
DNA74218. 

Figure 406 shows a nucleotide sequence (SEQ ID NO:406) designated herein as 
DNA74228. 

30 Figure 407 shows a nucleotide sequence (SEQ ID NO:407) designated herein as 

DNA74256. 
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Figure 408 shows a nucleotide sequence (SEQ ID NO 408) designated herein as 
DNA750A2. 

Figure 409 shows a nucleotide sequence (SEQ ID NO 409) designated herein as 
DNA76137. 

Figure 410 shows a nucleotide sequence (SEQ ID NO:410) designated herein as 
5 DNA76158. 

Figure 411 shows a nucleotide sequence (SEQ ID NO.411) designated herein as 
DNA77098. 

Figure 412 shows a nucleotide sequence (SEQ ID NO:412) designated herein as 
DNA77791. 

10 Figure 41 3 shows a nucleotide sequence (SEQ ID NO:413) designated herein as 
DNA77968. 

Figure 414 shows a nucleotide sequence (SEQ ID NO:414) designated herein as 
DNA77976. 

Figure 415 shows a nucleotide sequence (SEQ ID NO:415) designated herein as 
15 DNA78017. 

Figure 4 16 shows a nucleotide sequence (SEQ ID NO:416) designated herein as 
DNA78095. 

Figure 417 shows a nucleotide sequence (SEQ ID NO:417) designated herein as 
DNA78I03. 

20 Figure 418 shows a nucleotide sequence (SEQ ID NO:418) designated herein as 
DNA781 13. 

Figure 419 shows a nucleotide sequence (SEQ ID NO:419) designated herein as 
DNA78746. 

Figure 420 shows a nucleotide sequence (SEQ ID NO:420) designated herein as 
25 DNA78759. 

Figure 421 shows a nucleotide sequence (SEQ ID NO:421) designated herein as 
DNA78796. 

Figure 422 shows a nucleotide sequence (SEQ ID NO:422) designated herein as 
DNA79561. 

30 Figure 423 shows a nucleotide sequence (SEQ ID NO:423) designated herein as 
DNA79602. 

31 





Figure 


424 


4 

shows 


I 

a 


nucleotide 


sequence 


(SFQ 


ID 


• 

NO:424) 


designated 


herein 


as 




DNA7W17. 


























Figure 


425 


shows 


a 


nucleotide 


sequence 


(SBQ 


ID 


NO:425) 


designated 


herein 


as 




DNA7<>(>28. 


























Figure 


426 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO;426) 


designated 


herein 


as ! 


5 


DNA7 ( >640. 


























Figure 


427 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:427) 


designated 


herein 


as 




DNA79661. 


























Figure 


428 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:428) 


designated 


herein 


as 




DNA7<>684. 
























10 


Figure 


429 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:429) 


designated 


herein 


as 




DNA79717. 


























Figure 


430 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO 430) 


designated 


herein 


as j 




DNA79733. 


























Figure 


431 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:431) 


designated 


herein 


as 


15 


DNA7<><>7(). 


























Figure 432 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO 432) 


designated 


herein 


as 




DNA80050. 


























Figure 433 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:433) 


designated 


herein 


as 




DNA80247. 
























20 


Figure 


434 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO-434) 


designated 


herein 


as 




DNA80265. 


























Figure 


435 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO 435) 


designated 


herein 


as 




DNA80615. 


























Figure 


436 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:436) 


designated 


herein 


as 


25 


DNA80623. 


























Figure 437 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:437) 


designated 


herein 


as 




DNA80627. 


























Figure 438 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO. 438) 


designated 


herein 


as 




DNA81896. 
























30 


Figure 


439 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:439) 


designated 


herein 


as 




DNA81918. 
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Figure 440 shows a nucleotide sequence (SE:Q ID NO:440) designated herein as 
DNA81976. 

Figure 441 shows a nucleotide sequence (SEQ ID NO:441) designated herein as 
DNA82017. 

Figure 442 shows a nucleotide sequence (SEQ ID NO:442) designated herein as 
5 DNA82024. 

Figure 443 shows a nucleotide sequence (SEQ ID NO:443) designated herein as 
DNA82027. 

Figure 444 shows a nucleotide sequence (SEQ ID NO:444) designated herein as 
DNA821 15. 

10 Figure 445 shows a nucleotide sequence (SEQ ID NO:445) designated herein as 
DNA82154. 

Figure 446 shows a nucleotide sequence (SEQ ID NO:446) designated herein as 
DNA82157. 

Figure 447 shows a nucleotide sequence (SEQ ID NO:447) designated herein as 
15 DNA82166. 

Figure 448 shows a nucleotide sequence (SEQ ID NO:448) designated herein as 
DNA82182. 

Figure 449 shows a nucleotide sequence (SEQ ID NO:449) designated herein as 
DNA82212. 

20 Figure 450 shows a nucleotide sequence (SEQ ID NO:450) designated herein as 
DNA82498. 

Figure 451 shows a nucleotide sequence (SEQ ID NO:451) designated herein as 
DNA82499. 

Figure 452 shows a nucleotide sequence (SEQ ID NO:452) designated herein as 
25 DNA82504. 

Figure 453 shows a nucleotide sequence (SEQ ID NO:453) designated herein as 
DNA82531. 

Figure 454 shows a nucleotide sequence (SEQ ID NO:454) designated herein as 
DNA82693. 

30 Figure 455 shows a nucleotide sequence (SEQ ID NO:455) designated herein as 
DNA82702. 
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Figure 


450 
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shows 
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nueleotide 


sequence 


(SEQ 


ID 


• 

NO:456) designated 


herein 


as 




DNA82786. 
























Figure 


457 


shows 
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nucleotide 


sequence 


(SEQ 


ID 


NG 457) designated 


herein 


as 




DNA8285L 
























Figure 


458 


shows 
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nucleotide 


sequence 


(SFQ 


ID 


NO:458) designated 


herein 


as 


5 


DNA82898. 
























Figure 


459 


shows 
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nucleotide 


sequence 


(SEQ 


ID 


NO:459) designated 


herein 


as 




DNA82935. 
























Figure 


460 


shows 
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nucleotide 


sequence 


(SEQ 


ID 


NO:460) designated 


herein 


as 




DNA82977. 
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Figure 


461 


shows 


a 


nucleotide 


sequence 


(SEQ 


ID 


NO:461) designated 


herein 


as 




DNA82989. 
























Figure 


462 


shows 
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nucleotide 


sequence 


(SEQ 


ID 


NO:462) designated 


herein 


as 




DNA83628. 
























Figure 


463 


shows 
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nucleotide 


sequence 


(SEQ 


ID 


NO:4o3) designated 


herein 


as 


15 


DNA83630. 
























Figure 


464 


shows 
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nucleotide 


sequence 


(SEQ 


ID 


NO:464) designated 


herein 


as 




DNA83749. 
























Figure 


405 


shows 
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nucleotide 


sequence 


(SEQ 


ID 


NO:465) designated 


herein 


as 




DNA83772. 
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Figure 


466 


shows 
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nucleotide 


sequence 


(SEQ 


ID 


NO:4o6) designated 


herein 


as 




DNA83800. 
























Figure 


467 


shows 
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nucleotide 


sequence 


(SEQ 


ID 


NO;4o7) designated 


herein 


as 




DNA83950. 
























Figure 


468 


shows 
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nucleotide 


sequence 


(SEQ 


ID 


NO:468) designated 


herein 


as 


25 


DNA84027. 
























Figure 


409 


shows 
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nucleotide 


sequence 


(SEQ 


ID 


NO:469) designated 


herein 


as 




DNA84076. 
























Figure 


470 


shows 
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nucleotide 


sequence 


(SEQ 


ID 


NO:470) designated 


herein 


as 




DNA84109. 
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Figure 


471 


shows 
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nucleotide 


sequence 


(SEQ 


ID 


NO:471) designated 


herein 


as 




DNA85072. 
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Figure 472 shows a nucleotide sequence (SEQ ID NO:472) designated herein as 
DNA85154. 

Figure 473 shows a nucleotide sequence (SEQ) JD NO:473) designated herein as 
DNA85193. 

Figure 474 shows a nucleotide sequence (SEQ ID NO:474) designated herein as 
5 DNA85224. 

Figure 475 shows a nucleotide sequence (SEQ ID NO:475) designated herein as 
DNA85237. 

Figure 476 shows a nucleotide sequence (SEQ ID NO:476) designated herein as 
DNA85289. 

10 Figure 477 shows a nucleotide sequence (SEQ ID NO:477) designated herein as 

DNA85357. 

Figure 478 shows a nucleotide sequence (SEQ ID NO:478) designated herein as 
DNA8536L 

Figure 479 shows a nucleotide sequence (SEQ ID NO:479) designated herein as 
15 DNA85371. 

Figure 480 shows a nucleotide sequence (SEQ ID NO:480) designated herein as 
DNA86875. 

Figure 481 shows a nucleotide sequence (SEQ ID NO:481) designated herein as 
DNA86876. 

20 Figure 482 shows a nucleotide sequence (SEQ ID NO:482) designated herein as 

DNA86905. 

Figure 483 shows a nucleotide sequence (SEQ ID NO:483) designated herein as 
DNA86945. 

Figure 484 shows a nucleotide sequence (SEQ ID NO:484) designated herein as 
25 DNA86969. 

Figure 485 shows a nucleotide sequence (SEQ ID NO:485) designated herein as 
DNA87050. 

Figure 486 shows a nucleotide sequence (SEQ ID NO:486) designated herein as 
DNA87094. 

30 Figure 487 shows a nucleotide sequence (SEQ ID NO:487) designated herein as 

DNA87126. 



35 




Figure 488 shows a nucleotide sequence (SEQ ID NO:488) designated herein as 
DNA87493. 

Figure 489 shows a nucleotide sequence (SEQ ID NO:489) designated herein as 
DNA87494. 

Figure 490 shows a nucleotide sequence (SEQ ID NO:49()) designated herein as 
5 DNA87505. 

Figure 491 shows a nucleotide sequence (SEQ ID NO:491) designated herein as 
DNA87566. 

Figure 4 l >2 shows a nucleotide sequence (SEQ ID NO:492) designated herein as 
DNA87586. 

10 Figure 4 4 >3 shows a nucleotide sequence (SEQ ID NO:493) designated herein as 
DNA87649. 

Figure 4^4 shows a nucleotide sequence (SEQ ID NO:494) designated herein as 
DNA89340. 

Figure 4 C >5 shows a nucleotide sequence (SEQ ID NO:495) designated herein as 
15 DNA89355. 

Figure 496 shows a nucleotide sequence (SEQ ID NO:496) designated herein as 
DNA89365. 

Figure 497 shows a nucleotide sequence (SEQ ID NO:497) designated herein as 
DNA89419. 

20 Figure 498 shows a nucleotide sequence (SEQ ID NO:498) designated herein as 
DNA89470. 

Figure 499 shows a nucleotide sequence (SEQ ID NO:499) designated herein as 
DNA89480. 

Figure 500 shows a nucleotide sequence (SEQ ID NO:500) designated herein as 
25 DNA89549. 

Figure 50 1 shows a nucleotide sequence (SEQ ID NO:501) designated herein as 
DNA8<>606. 

Figure 502 shows a nucleotide sequence (SEQ ID NO:502) designated herein as 
DNA89615. 

30 Figure 503 shows a nucleotide sequence (SEQ ID NO:5()3) designated herein as 
DNA89669. 

36 




Figure 504 shows a nucleotide sequence (SEQ ID NO:5()4) designated herein as 
DNA89760. 

Figure 505 shows a nucleotide sequence (SEQ ID NO:5()5) designated herein as 
DNA89766. 

Figure 506 shows a nucleotide sequence (SEQ ID NO: 506) designated herein as 
5 DNA89772. 

Figure 507 shows a nucleotide sequence (SEQ ID NO:507) designated herein as 
DNA89773. 

Figure 508 shows a nucleotide sequence (SEQ ID NO:508) designated herein as 
DNA89774. 

10 Figure 500 shows a nucleotide sequence (SEQ ID NO:5()9) designated herein as 
DNA80872. 

Figure 510 shows a nucleotide sequence (SEQ ID NO:510) designated herein as 
DNA89918. 

Figure 51 1 shows a nucleotide sequence (SEQ ID NO:511) designated herein as 
15 DNA89928. 

Figure 5I2 shows a nucleotide sequence (SEQ ID NO:512) designated herein as 
DNA89930. 

Figure 513 shows a nucleotide sequence (SEQ ID NO:513) designated herein as 
DNA91463. 

20 Figure 514 shows a nucleotide sequence (SEQ ID NO:514) designated herein as 
DNA<>1507. 

Figure 515 shows a nucleotide sequence (SEQ ID NO:515) designated herein as 
DNA<>3615. 

Figure 516 shows a nucleotide sequence (SEQ ID NO:516) designated herein as 
25 DNA04011. 

Figure 517 shows a nucleotide sequence (SEQ ID NO:517) designated herein as 
DNA94043. 

Figure 518 shows a nucleotide sequence (SEQ ID NO:518) designated herein as 
DNA94050. 

30 Figure 519 shows a nucleotide sequence (SEQ ID NO:519) designated herein as 
DNA94097. 
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Figure 520 shows a nucleotide sequence (SEQ II) NO:520) designated herein as 
DNA94098. 

Figure 521 shows a nucleotide sequence (SEQ ID NO:521) designated herein as 
DNA94100. 

Figure 522 shows a nucleotide sequence (SEQ ID NO:522) designated herein as 
5 DNA94126. 

Figure 523 shows a nucleotide sequence (SEQ ID NO:523) designated herein as 
DNA94136. 

Figure 524 shows a nucleotide sequence (SFQ ID NO:524) designated herein as 
DNA04156. 

10 Figure 525 shows a nucleotide sequence (SEQ ID NO:525) designated herein as 
DNA94219. 

Figure 526 shows a nucleotide sequence (SEQ ID NO:526) designated herein as 
DNA^4254. 

Figure 527 shows a nucleotide sequence (SEQ ID NO:527) designated herein as 
15 DNA94274. 

Figure 528 shows a nucleotide sequence (SEQ ID NO:528) designated herein as 
DNA94292. 

Figure 529 shows a nucleotide sequence (SEQ ID NO:529) designated herein as 
DNA94360. 

20 Figure 530 shows a nucleotide sequence (SEQ ID NO:530) designated herein as 
DNA94377. 

Figure 531 shows a nucleotide sequence (SEQ ID NO:531) designated herein as 
DNA94477. 

Figure 532 shows a nucleotide sequence (SEQ ID NO:532) designated herein as 
25 DNA945I8. 

Figure 533 shows a nucleotide sequence (SEQ ID NO:533) designated herein as 
DNA94533. 

Figure 534 shows a nucleotide sequence (SEQ ID NO:534) designated herein as 
DNA95370. 

30 Figure 535 shows a nucleotide sequence (SEQ ID NO:535) designated herein as 
DNA97358. 
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Figure 536 shows a nucleotide sequence (SEQ ID NO:536) designated herein as 
DNA97374. 

Figure 537 shows a nucleotide sequence (SEQ ID NO: 537) designated herein as 
DNA97470. 

Figure 538 shows a nucleotide sequence (SEQ ID NO:538) designated herein as 
5 DN/V>7581. 

Figure 539 shows a nucleotide sequence (SEQ ID NO:539) designated herein as 
DN/V>7767. 

Figure 540 shows a nucleotide sequence (SEQ ID NO:540) designated herein as 
DN,V>7842. 

10 Figure 54 1 shows a nucleotide sequence (SEQ ID NO:541) designated herein as 
DNA<>7949. 

Figure 542 shows a nucleotide sequence (SEQ ID NO: 542) designated herein as 
DNA^7987. 

Figure 543 shows a nucleotide sequence (SEQ ID NO:543) designated herein as 
15 DNA<>7995 

Figure 544 shows a nucleotide sequence (SEQ ID NO:544) designated herein as 
DNA ( >82<>3. 

Figure 545 shows a nucleotide sequence (SEQ ID NO:545) designated herein as 
DNA<>8294. 

20 Figure 546 shows a nucleotide sequence (SEQ ID NO:546) designated herein as 
DNA<>8346. 

Figure 547 shows a nucleotide sequence (SEQ ID NO:547) designated herein as 
DNA<>8360. 

Figure 548 shows a nucleotide sequence (SEQ ID NO:548) designated herein as 
25 DNA98829. 

Figure 549 shows a nucleotide sequence (SEQ ID NO:549) designated herein as 
DNAI01514. 

Figure 550 shows a nucleotide sequence (SEQ ID NO:550) designated herein as 
DNA101572. 

30 Figure 55 1 shows a nucleotide sequence (SEQ ID NO:55l) designated herein as 
DNA101580. 



39 




Figure 552 shows a nucleotide sequence (SFQ ID NO:552) designated herein as 
DN A 101595. 

Figure 553 shows a nucleotide sequence (SEQ ID NO:553) designated herein as 
DNA101633. 

Figure 554 shows a nucleotide sequence (SEQ ID NO:554) designated herein as 
5 DNAI01717. 

Figure 555 shows a nucleotide sequence (SEQ ID NO:555) designated herein as 
DNA101768. 

Figure 556 shows a nucleotide sequence (SEQ ID NO:556) designated herein as 
DNA 107332. 

10 Figure 557 shows a nucleotide sequence (SEQ ID NO:557) designated herein as 

DNA43499. 

Figure 558 shows a nucleotide sequence (SEQ ID NO:558) designated herein as 
DNA45713. 

Figure 559 shows a nucleotide sequence (SEQ ID NO:559) designated herein as 
15 DNA46089. 

Figure 560 shows a nucleotide sequence (SEQ ID NO: 560) designated herein as 
DNA68256. 

Figure 561 shows a nucleotide sequence (SEQ ID NO:561) designated herein as 
DNA70305. 

20 Figure 562 shows a nucleotide sequence (SEQ ID NO:562) designated herein as 

DNA82953. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
I. Definitions 

25 The term "SRT polypeptide" when used herein encompasses "native sequence SRT 

polypeptides'' and "SRT polypeptide variants" (which are further defined herein). "SRT" is a 
designation given to those polypeptides which arc encoded by the nucleic acid molecules shown 
in the accompanying figures and variants thereof, nucleic acid molecules comprising the 
sequence shown in the accompanying figures and variants thereof as well as fragments of the 

30 above. The SRT polypeptides of the invention may be isolated from a variety of sources, such 
as from human tissue types or from another source, or prepared by recombinant and/or synthetic 
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methods. 

A "native sequence" SRT polypeptide comprises a polypeptide having the same amino 
acid sequence as the corresponding SRT polypeptide derived from nature. Such native sequence 
SRT polypeptides can be isolated from nature or can be produced by recombinant and/or 
synthetic means. The term "native sequence SRT polypeptide" specifically encompasses 
5 naturally-occurring truncated or secreted forms (e.g , an extracellular domain sequence), 
naturally-occurring variant forms (e.g., alternatively spliced forms) and naturally-occurring allelic 
variants of the polypeptide. 

An SRT polypeptide "extracellular domain"' or "ECD" refers to a form of the SRT 
polypeptide which is essentially free of the transmembrane and cytoplasmic domains. Ordinarily, 

10 an SRT polypeptide ECD will have less than about 1% of such transmembrane and/or 
cytoplasmic domains and preferably, w ill have less than about 0.5% of such domains. It will be 
understood that any transmembrane domain(s) identified for the SRT polypeptides of the present 
invention are identified pursuant to criteria routinely employed in the art for identifying that type 
of hydrophobic domain. The exact boundaries of a transmembrane domain may vary but most 

1 5 likely by no more than about 5 amino acids at either end of the domain as initially identified. 

"Variant SRT polypeptide" means an active SRT polypeptide as defined below having 
at least about 80% amino acid sequence identity with the amino acid sequence of a specifically 
derived fragment of any other polypeptide which will be specifically recited. Such variant SRT 
polypeptides include, for instance, SRT polypeptides wherein one or more amino acid residues 

20 are added, or deleted, at the N- and/or C-tcrrninus, as well as within one or more internal 
domains, of the full-length amino acid sequence. Ordinarily, a variant SRT polypeptide will ha\e 
at least about 80% amino acid sequence identity, more preferably at least about 8 1 % amino acid 
sequence identity, more preferably at least about 82%> amino acid sequence identity, more 
preferably at least about 83% amino acid sequence identity, more preferably at least about 84' c 

25 amino acid sequence identity, more preferably at least about 85% amino acid sequence identity, 
more preferably at least about 86% amino acid sequence identity, more preferably at least about 
87% amino acid sequence identity, more preferably at least about 88% amino acid sequence 
identity, more preferably at least about 89% amino acid sequence identity, more preferably at 
least about 90% amino acid sequence identity, more preferably at least about 91%' amino acid 

30 sequence identity, more preferably at least about 92% amino acid sequence identity, more 
preferably at least about 93% amino acid sequence identity, more preferably at least about 94% 
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amino acid sequence identity, more preferably at least about 959£ amino acid sequence identity, 
more preferably at least about 9(V/c amino acid sequence identity, more preferably at least about 
979f amino acid sequence identity, more preferably at least about 9H r /c amino acid sequence 
identity and yet more preferably at least about 999r amino acid sequence identity with an SRT 
polypeptide encoded by a nucleic acid molecule shown in one of the accompanying figures or 
5 a specified fragment thereof. SRT variant polypeptides do not encompass the native SRT 
polypeptide sequence. Ordinarily, SRT variant polypeptides are at least about 10 amino acids 
in length, often at least about 20 amino acids in length, more often at least about 30 amino acids 
in length, more often at least about 40 amino acids in length, more often at least about 50 amino 
acids in length, more often at least about 60 amino acids in length, more often at least about 70 
10 amino acids in length, more often at least about 80 amino acids in length, more often at least 
about 90 amino acids in length, more often at least about 100 amino acids in length, more often 
at least about 150 amino acids in length, more often at least about 200 amino acids in length, 
more often at least about 250 amino acids in length, more often at least about 300 amino acids 
in length, or more. 

15 "Percent (%) amino acid sequence identity" with respect to the SRT polypeptide 

sequences identified herein is defined as the percentage of amino acid residues in a candidate 
sequence that are identical with the amino acid residues in a SRT sequence, after aligning the 
sequences and introducing gaps, if necessary, to achieve the maximum percent sequence identity, 
and not considering any conservative substitutions as part of the sequence identity. Alignment 

20 for purposes of determining percent amino acid sequence identity can be achieved in various 
ways that are within the skill in the art, for instance, using publicly available computer software 
such as BLAST, BLAST-2, ALIGN. ALIGN-2 or Megalign (DNASTAR) software. Those 
skilled in the art can determine appropriate parameters for measuring alignment, including any 
algorithms needed to achieve maximal alignment over the full-length of the sequences being 

25 compared. For purposes herein, however, % amino acid sequence identity values are obtained 
as described below by using the sequence comparison computer program ALIGN-2, wherein the 
complete source code for the ALIGN-2 program is provided in Table l . The ALIGN-2 sequence 
comparison computer program was authored by Genentcch, Inc. and the source code shown in 
Table 1 has been filed with user documentation in the U.S. Copyright Office, Washington D.C.. 

30 20559, where it is registered under U.S. Copyright Registration No. TXU5 10087. The ALIGN-2 
program is publicly available through Genentech, Inc., South San Francisco, California or may 
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be compiled from the source code provided in Table I. The ALIGN-2 program should be 
compiled lor use on a UNIX operating system, preferably digital UNIX V4.0D. All sequence 
comparison parameters are set by the ALIGN-2 program and do not vary. 

For purposes herein, the r /< amino acid sequence identity of a given amino acid sequence 
A to, with, or against a given amino acid sequence B (which can alternatively be phrased as a 
5 given amino acid sequence A that has or comprises a certain % amino acid sequence identity to, 
with, or against a given amino acid sequence B) is calculated as follows: 

100 times the fraction X/Y 

10 where X is the number of amino acid residues scored as identical matches by the sequence 
alignment program ALIGN-2 in that program's alignment of A and B, and where Y is the total 
number of amino acid residues in B. It will be appreciated that where the length of amino acid 
sequence A is not equal to the length of amino acid sequence B, the c fc amino acid sequence 
identity of A to B will not equal the % amino acid sequence identity of B to A. As examples of 

15 c c amino acid sequence identity calculations. Tables 2 and 3 demonstrate how to calculate the 
c 'c amino acid sequence identity of the amino acid sequence designated "Comparison Protein" 
to the amino acid sequence designated "PRO". 

Unless specifically stated otherwise, all 7r amino acid sequence identity values used 
herein are obtained as described above using the ALIGN-2 sequence comparison computer 

20 program. However, % amino acid sequence identity may also be determined using the sequence 
comparison program NCBI-BLAST2 (Altschul ct al.. Nucleic Acids Res. 25:3389-3402 (1997)). 
The NCBI-BLAST2 sequence comparison program may be downloaded from 
http://www.ncbi.nlm.nih.gov. NCBI-BLAST2 uses several search parameters, wherein all of 
those search parameters are set to default values including, for example, unmask = yes, strand 

25 = all, expected occurrences = 10, minimum low complexity length = 15/5, multi-pass c- value = 
0.01, constant for multi-pass = 25, dropoff for final gapped alignment = 25 and scoring matrix 
= BLOSUM62. 

In situations where NCBI-BLAST2 is employed for amino acid sequence comparisons, 
the ( 7r amino acid sequence identity of a given amino acid sequence A to, with, or against a giv en 
30 amino acid sequence B (which can alternatively be phrased as a given amino acid sequence A 
that has or comprises a certain % amino acid sequence identity to, with, or against a given amino 
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acid sequence B) is calculated as follows: 

100 times the fraction X/Y 

where X is the number of amino acid residues scored as identical matches by the sequence 
5 alignment program NCBI-BLAST2 in that program's alignment of A and B. and where Y is the 
total number of amino acid residues in B. It will be appreciated that where the length of ammo 
acid sequence A is not equal to the length of amino acid sequence B, the % amino acid sequence 
identity of A to B will not equal the % amino acid sequence identity of B to A. 

"SRT variant polynucleotide" or 4k SRT variant nucleic acid sequence" means a nucleic 

1 0 acid molecule which has at least about 80% nucleic acid sequence identity with any of the nucleic 
acid sequences shown in the accompanying figures or a specified fragment thereof. Ordinarily, 
a SRT variant polynucleotide will have at least about 80% nucleic acid sequence identity, more 
preferably at least about 81% nucleic acid sequence identity, more preferably at least about 82% 
nucleic acid sequence identity, more preferably at least about 83% nucleic acid sequence identity. 

1 5 more preferably at least about 84% nucleic acid sequence identity, more preferably at least about 
85% nucleic acid sequence identity, more preferably at least about 86' r nucleic acid sequence 
identity, more preferably at least about 87% nucleic acid sequence identity, more preferably at 
least about 88% nucleic acid sequence identity, more preferably at least about 89% nucleic acid 
sequence identity, more preferably at least about 90% nucleic acid sequence identity, more 

20 preferably at least about 9 1 % nucleic acid sequence identity, more preferably at least about 92' h 
nucleic acid sequence identity, more preferably at least about 93% nucleic acid sequence identity, 
more preferably at least about 94% nucleic acid sequence identity, more preferably at least about 
95% nucleic acid sequence identity, more preferably at least about 96% nucleic acid sequence 
identity, more preferably at least about 97% nucleic acid sequence identity, more preferably at 

25 least about 98% nucleic acid sequence identity and yet more preferably at least about 99% 
nucleic acid sequence identity with any of the nucleic acid sequences shown in the accompanying 
figures or a specified fragment thereof. SRT polynucleotide variants do not encompass the native 
SRT nucleotide sequence. 

Ordinarily, SRT variant polynucleotides are at least about 10 nucleotides in length, often 

30 at least about 1 5 nucleotides in length, often at least about 20 nucleotides in length, often at least 
about 25 nucleotides in length, often at least about 30 nucleotides in length, often at least about 
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35 nucleotides in length, often at least about 40 nucleotides in length, often at least about 45 
nucleotides in length, often at least about 50 nucleotides in length, often at least about 55 
nucleotides in length, often at least about 60 nucleotides in length, often at least about 65 
nucleotides in length, often at least about 65 nucleotides m length, often at least about 70 
nucleotides in length, often at least about 75 nucleotides in length, often at least about 80 
5 nucleotides in length, often at least about 85 nucleotides in length, often at least about 90 
nucleotides in length, often at least about 95 nucleotides in length, often at least about 100 
nucleotides in length, or more. 

"Percent (%) nucleic acid sequence identity" with respect to the SRT polypeptide- 
eneoding nucleic acid sequences identified herein is defined as the percentage of nucleotides in 

10 a candidate sequence that are identical with the nucleotides in a SRT polypeptide-encoding 
nucleic acid sequence, after aligning the sequences and introducing gaps, if necessary, to achieve 
the maximum percent sequence identity. Alignment for purposes of determining percent nucleic 
acid sequence identity can be achieved in various ways that are within the skill in the art, for 
instance, using publicly available computer software such as BLAST, BLAST-2, ALIGN, 

1 5 ALIGN-2orMegalign (DNASTAR) software. Those skilled in the art can determine appropriate 
parameters for measuring alignment, including any algorithms needed to achieve maximal 
alignment over the full-length of the sequences being compared. For purposes herein, however, 
c /c nucleic acid sequence identity values arc obtained as described below by using the sequence 
comparison computer program ALIGN-2, wherein the complete source code for the ALIGN-2 

20 program is provided in Table I . The ALIGN-2 sequence comparison computer program was 
authored by Gcnentech, Inc. and the source code shown in Table l has been filed with user 
documentation in the U.S. Copyright Office, Washington D.C., 20559, where it is registered 
under U.S. Copyright Registration No. TXU5 10087. The ALIGN-2 program is publicly available 
through Gcnentech, Inc., South San Francisco, California or may be compiled from the source 

25 code provided in Table 1. The ALIGN-2 program should be compiled for use on a UNIX 
operating system, preferably digital UNIX V4.0D. All sequence comparison parameters are set 
by the ALIGN-2 program and do not vary. 

For purposes herein, the 7c nucleic acid sequence identity of a given nucleic acid 
sequence C to, with, or against a given nucleic acid sequence D (which can alternatively be 

30 phrased as a given nucleic acid sequence C that has or comprises a certain % nucleic acid 
sequence identity to, with, or against a given nucleic acid sequence D) is calculated as follows: 
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100 times the traction W/Z 

where W is the number of nucleotides scored as identical matches by the sequence alignment 
program ALIGN -2 in that program's alignment of C and D. and where Z is the total number of 
nucleotides in D. It will be appreciated that where the length of nucleic acid sequence C is not 
5 equal to the length of nucleic acid sequence D, the % nucleic acid sequence identity of C to D 
w ill not equal the % nucleic acid sequence identity of D to C. As examples of r /r nucleic acid 
sequence identity calculations, Tables 4 and 5 demonstrate how to calculate the ( 7c nucleic acid 
sequence identity of the nucleic acid sequence designated "Comparison DNA" to the nucleic acid 
sequence designated "PRO-DNA \ 

10 Unless specifically stated otherwise, all % nucleic acid sequence identity values used 

herein are obtained as described above using the ALIGN-2 sequence comparison computer 
program. However, c /r nucleic acid sequence identity may also be determined using the sequence 
comparison program NCBI-BLAST2 ( Altschul ct al.. Nucleic Acids Res. 25:3389-3402 ( 1 997)). 
The NCBI-BLAST2 sequence comparison program may be downloaded from 

15 http://www.ncbi.nlm.nih.gov. NCBI BLAST2 uses several search parameters, wherein all of 
those search parameters are set to default values including, for example, unmask = yes, strand 
= all, expected occurrences = 10, minimum low complexity length = 15/5, multi-pass e-value = 
0.01, constant for multi-pass = 25, dropoff for final gapped alignment = 25 and scoring matrix 
= BLOSUM62. 

20 In situations w here NCBI-BLAST2 is employed for sequence comparisons, the % nucleic 

acid sequence identity of a given nucleic acid sequence C to, with, or against a given nucleic acid 
sequence D (which can alternatively be phrased as a given nucleic acid sequence C that has or 
comprises a certain ' .h nucleic acid sequence identity to, with, or against a given nucleic acid 
sequence D) is calculated as follows: 

25 

100 times the fraction W/Z 

where W is the number of nucleotides scored as identical matches by the sequence alignment 
program NCBI-BLAST2 in that program's alignment of C and D, and where Z is the total 
30 number of nucleotides in D. It will be appreciated that where the length of nucleic acid sequence 
C is not equal to the length of nucleic acid sequence D, the % nucleic acid sequence identity of 
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C to D will not equal the ( ,'< nucleic acid sequence identity of D to C. 

In other embodiments. SRT variant polynucleotides are nucleic acid molecules that 
encode an active SRT polypeptide and which are capable of hybridizing, preferably under 
stringent hybridization conditions, to any of the nucleotide sequences shown in the accompanying 
figures or their complements. SRT variant polypeptides may be those that are encoded by a SRT 
5 variant polynucleotide. 

The term "positives", in the context of the amino acid sequence identity comparisons 
performed as described above, includes amino acid residues in the sequences compared that are 
not only identical, but also those that have similar properties. Amino acid residues that score a 
positive value to an amino acid residue of interest are those that are either identical to the amino 
1 0 acid residue of interest or are a preferred substitution (as defined in Table 6 below) of the amino 
acid residue of interest. 

For purposes herein, the 9r value of positives of a given amino acid sequence A to, with, 
or against a given amino acid sequence B (which can alternatively be phrased as a given amino 
acid sequence A that has or comprises a certain c /c positives to, with, or against a given amino 
15 acid sequence B) is calculated as follows: 

100 times the fraction X/Y 

where X is the number of amino acid residues scoring a positive value as defined above by the 
20 sequence alignment program ALIGN-2 in that program's alignment of A and B, and where Y is 
the total number of amino acid residues in B. It will be appreciated that where the length of 
amino acid sequence A is not equal to the length of amino acid sequence B, the % positives of 
A to B will not equal the % positives of B to A. 

"Isolated," when used to describe the various polypeptides disclosed herein, means 
25 polypeptide that has been identified and separated and/or recovered from a component of its 
natural environment. Preferably, the isolated polypeptide is free of association with all 
components with which it is naturally associated. Contaminant components of its natural 
environment are materials that would typically interfere with diagnostic or therapeutic uses for 
the polypeptide, and may include enzymes, hormones, and other proteinaceous or non- 
30 protcinaccous solutes. In preferred embodiments, the polypeptide will be purified ( 1 ) to a degree 
sufficient to obtain at least 1 f> residues of N-tcrminal or internal amino acid sequence by use of 
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a spinning cup sequenator, or (2) to homogeneity by SDS-PAGE under non-reducing or reducing 
conditions using Coomassic blue or. preferably, silver stain. Isolated polypeptide includes 
polypeptide in situ within recombinant cells, since at least one component of the SRT natural 
environment will not be present. Ordinarily, however, isolated polypeptide will be prepared by 
at least one purification step. 
5 An "isolated" nucleic acid molecule encoding a SRT polypeptide is a nucleic acid 

molecule that is identified and separated from at least one contaminant nucleic acid molecule 
with which it is ordinarily associated in the natural source of the SRT-encoding nucleic acid. 
Preferably, the isolated nucleic is free of association with all components with which it is 
naturally associated. An isolated SRT-encoding nucleic acid molecule is other than in the form 

10 or setting in which it is found in nature. Isolated nucleic acid molecules therefore are 
distinguished from the SRT-encoding nucleic acid molecule as it exists in natural cells. 
However, an isolated nucleic acid molecule encoding a SRT polypeptide includes SRT-encoding 
nucleic acid molecules contained in cells that ordinarily express SRT where, for example, the 
nucleic acid molecule is in a chromosomal location different from that of natural cells. 

15 The term "control sequences" refers to DNA sequences necessary for the expression of 

an operably linked coding sequence in a particular host organism. The control sequences that are 
suitable for prokaryotes, for example, include a promoter, optionally an operator sequence, and 
a ribosome binding site. Eukaryotic cells arc known to utilize promoters, polyadenylation 
signals, and enhancers. 

20 Nucleic acid is "operably linked" when it is placed into a functional relationship with 

another nucleic acid sequence. For example, DNA for a presequence or secretory leader is 
operably linked to DNA for a polypeptide if it is expressed as a preprotein that participates in the 
secretion of the polypeptide; a promoter or enhancer is operably linked to a coding sequence if 
it affects the transcription of the sequence; or a ribosome binding site is operably linked to a 

25 coding sequence if it is positioned so as to facilitate translation. Generally, "operably linked" 
means that the DNA sequences being linked are contiguous, and, in the case of a secretory leader, 
contiguous and in reading phase. However, enhancers do not have to be contiguous. Linking 
is accomplished by ligation at convenient restriction sites. If such sites do not exist, the synthetic 
oligonucleotide adaptors or linkers are used in accordance with conventional practice. 

30 The term "antibody" is used in the broadest sense and specifically covers, for example, 

single anti-SRT monoclonal antibodies (including agonist, antagonist, and neutralizing 
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antibodies ).anti-SRT antibody compositions with polycpitopic specificity, single chain anti-SRT 
antibodies, and fragments of anti-SRT antibodies (see below). The term "monoclonal antibody" 
as used herein refers to an antibody obtained from a population of substantially homogeneous 
antibodies, i.e.. the individual antibodies comprising the population are identical except Im- 
possible naturally-occurring mutations that may be present in minor amounts. 
5 "Stringency" of hybridization reactions is readily determinable by one of ordinary skill 

in the art, and generally .s an empirical calculation dependent upon probe length, washing 
temperature, and salt concentration. In general, longer probes require higher temperatures for 
proper annealing- while shorter probes need lower temperatures. Hybridization generally 
depends on the ability of denatured DNA to reanneal when complementary strands are present 
1 0 in an environment below their melting temperature. The higher the degree of desired homology 
between the probe and hybridizablc sequence, the higher the relative temperature which can be 
used. As a result, it follows that higher relative temperatures would tend to make the reaction 
conditions more stringent, while lower temperatures less so. For additional details and 
explanation of stringency of hybridization reactions, see Ausubel et al.. Current Protocols in 
1 5 Molecular Biology , Wiley Interscience Publishers. (1995). 

"Stringent conditions" or "high stringency conditions", as defined herein, may be 
identified by those that: ( 1 ) employ low ionic strength and high temperature for washing, for 
example 0.015 M sodium chloridc/0.001 5 M sodium citrate/0.1% sodium dodecyl sulfate at 
50 C C; (2) employ during hybridization a denaturing agent, such as formainide, for example, 50% 
20 ( v/v) formamide withO. 1 % bovine serum albumin/0. 1 % Ficoll/0. 1 % polyvinylpyrrolidonc/SOmM 
sodium phosphate buffer at pH 6.5 with 750 mM sodium chloride, 75 mM sodium citrate at 
42"C; or (3) employ 50% formamide, 5 x SSC (0.75 M NaCI, 0.075 M sodium citrate), 50 mM 
sodium phosphate ( P H 6.8). 0.1% sodium pyrophosphate, 5 x Denhardfs solution, sonicated 
salmon sperm DNA (50 pg/ml). 0.1'* SDS, and 10% dextran sulfate at 42 'C, with washes at 
25 42°C in 0.2 x SSC (sodium chloride/sodium citrate) and 50% formamide at 55 C C, followed by 
a high-stringency wash consisting of 0.1 x SSC containing EDTA at 55 C. 

"Moderately stringent conditions" may be identified as described by Sambrook et al.. 
Molecular CJonrn g: A ! aboraiQQ! Manual . New York: Cold Spring Harbor Press. 1989. and 
include the use of washing solution and hybridization conditions (e.g., temperature, ionic strength 
30 and %SDS) less stringent that those described above. An example of moderately stringent 
conditions is overnight incubation al 37 C in a solution comprising: 20% formamide, 5 x SSC 
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( 150 mM NaCl, 15 111M irisodium citrate). 50 mM sodium phosphate (pH 7.6), 5 x Dcnhardl's 
solution, MVA dextran sulfate, and 20 mg/ml denatured sheared salmon sperm DNA. followed 
by washing the filters in I x SSC at about 37-50 C. The skilled artisan will recognize how to 
adjust the temperature, ionic strength, etc. as necessary to accommodate factors such as probe 
length and the like. 

5 The term "epitope tagged" when used herein refers to a chimeric polypeptide comprising 

a SRT polypeptide fused to a "tag polypeptide". The tag polypeptide has enough residues to 
provide an epitope against which an antibody can be made, yet is short enough such that it does 
not interfere with activity of the polypeptide to which it is fused. The tag polypeptide preferably 
also is fairly unique so that the antibody does not substantially cross-react with other epitopes. 

1 0 Suitable tag polypeptides generally have at least six amino acid residues and usually between 
about 8 and 50 amino acid residues (preferably, between about 10 and 20 amino acid residues). 

As used herein, the term "immunoadhesin" designates antibody-like molecules which 
combine the binding specificity of a heterologous protein (an "adhesin") with the effector 
functions of immunoglobulin constant domains. Structurally, the immunoadhesins comprise a 

1 5 fusion of an amino acid sequence with the desired binding specificity which is other than the 
antigen recognition and binding site of an antibody (i.e., is "heterologous"), and an 
immunoglobulin constant domain sequence. The adhesin part of an immunoadhesin molecule 
typically is a contiguous amino acid sequence comprising at least the binding site of a receptor 
or a ligand. The immunoglobulin constant domain sequence in the immunoadhesin may be 

20 obtained from any immunoglobulin, such as IgG-l, IgG-2, IgG-3, or IgG-4 subtypes, IgA 
(including IgA- 1 and IgA-2), IgE, IgD or IgM. 

"Active" or "activity" for the purposes herein refers to form(s) of SRT which retain a 
biological and/or an immunological activity of native or naturally-occurring SRT, wherein 
"biological" activity refers to a biological function (either inhibitory or stimulatory) caused by 

25 a native or naturally-occurring SRT other than the ability to induce the production of an antibody 
against an antigenic epitope possessed by a native or naturally-occurring SRT and an 
"immunological" activity refers to the ability to induce the production of an antibody against an 
antigenic epitope possessed by a native or naturally-occurring SRT. 

The term "antagonist" is used in the broadest sense, and includes any molecule that 

30 partially or fully blocks, inhibits, or neutralizes a biological activity of a native SRT polypeptide 
disclosed herein. In a similar manner, the term "agonist" is used in the broadest sense and 
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includes any molecule that mimics a biological activity of a native SRT polypeptide disclosed 
herein. Suitable agonist or antagonist molecules specifically include agonist or antagonist 
antibodies or antibody fragments, fragments or amino acid sequence variants of native SRT 
polypeptides, peptides, small organic molecules, etc. Methods for identifying agonists or 
antagonists of a SRT polypeptide may comprise contacting a SRT polypeptide with a candidate 
5 agonist or antagonist molecule and measuring a detectable change in one or more biological 
activities normally associated with the SRT polypeptide. 

"Treatment" refers to both therapeutic treatment and prophylactic or preventative 
measures, wherein the object is to prevent or slow down (lessen) the targeted pathologic 
condition or disorder. Those in need of treatment include those already with the disorder as well 

10 as those prone to have the disorder or those in whom the disorder is to be prevented. 

"Chronic" administration refers to administration of the agent(s) in a continuous mode 
as opposed to an acute mode, so as to maintain the initial therapeutic effect (activity) for an 
extended period of time. "Intermittent" administration is treatment that is not consecutively done 
without interruption, but rather is cyclic in nature. 

15 "Mammal" for purposes of treatment refers to any animal classified as a mammal, 

including humans, domestic and farm animals, and zoo, sports, or pet animals, such as dogs, cats, 
cattle, horses, sheep, pigs, goats, rabbits, etc. Preferably, the mammal is human. 

Administration "in combination with" one or more further therapeutic agents includes 
simultaneous (concurrent) and consecutive administration in any order. 

20 "Carriers" as used herein include pharmaeeutically acceptable carriers, excipients, or 

stabilizers which are nontoxic to the cell or mammal being exposed thereto at the dosages and 
concentrations employed. Often the physiologically acceptable carrier is an aqueous pH buffered 
solution. Examples of physiologically acceptable carriers include buffers such as phosphate, 
citrate, and other organic acids; antioxidants including ascorbic acid; low molecular weight (less 

25 than about 10 residues) polypeptide; proteins, such as serum albumin, gelatin, or 
immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids such as 
glycine, glutamine, asparagine, arginine or lysine; monosaccharides, disaccharides, and other 
carbohydrates including glucose, mannose, or dextrins; chelating agents such as EDTA; sugar 
alcohols such as mannitol or sorbitol; salt-forming counterions such as sodium; and/or nonionic 

30 surfactants such as TWEEN ™, polyethylene glycol (PEG), and PLURONICS ™. 

"Antibody fragments" comprise a portion of an intact antibody, preferably the antigen 
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binding or variable region of the intact antibody. Examples of antibody fragments include Fab, 
Fab". F(ab"),, and Fv fragments; diabodies; linear antibodies (Zapata et al.. Protein Hng. 8( 10): 
1057-1062 [ 1995]); single-chain antibody molecules; and multispccific antibodies formed from 
antibody fragments. 

Papain digestion of antibodies produces two identical antigen-binding fragments, called 
5 "Fab" fragments, each with a single antigen-binding site, and a residual "Fc" fragment, a 
designation reflecting the ability to crystallize readily- Pepsin treatment yields an F(ab') : 
fragment that has two antigen-combining sites and is still capable of cross-linking antigen. 

"Fv" is the minimum antibody fragment which contains a complete antigen-recognition 
and -binding site. This region consists of a dimer of one heavy- and one light-chain variable 
1 0 domain in tight, non-covalent association. It is in this configuration that the three CDRs of each 
variable domain interact to define an antigen-binding site on the surface of the VH-VL dimer. 
Collectively, the six CDRs confer antigen-binding specificity to the antibody. However, even 
a single variable domain (or half of an Fv comprising only three CDRs specific for an antigen) 
has the ability to recognize and bind antigen, although at a lower affinity than the entire binding 
15 site. 

The Fab fragment also contains the constant domain of the light chain and the first 
constant domain (CHI ) of the heavy chain. Fab fragments differ from Fab' fragments by the 
addition of a few residues at the carboxy terminus of the heavy chain CH 1 domain including one 
or more cysteines from the antibody hinge region. Fab'-SH is the designation herein for Fab' in 

20 which the cysteine residue(s) of the constant domains bear a free thiol group. F(ab') : antibody 
fragments originally were produced as pairs of Fab' fragments which have hinge cysteines 
between them. Other chemical couplings of antibody fragments are also known. 

The "light chains" of antibodies (immunoglobulins) from any vertebrate species can be 
assigned to one of two clearly distinct types, called kappa and lambda, based on the amino acid 

25 sequences of their constant domains. 

Depending on the amino acid sequence of the constant domain of their heavy chains, 
immunoglobulins can be assigned to different classes. There are five major classes of 
immunoglobulins: IgA, IgD, IgE, IgG, and IgM, and several of these maybe further divided into 
subclasses (isotypes). e.g., IgG 1 , IgG2, IgG3, IgG4, IgA, and IgA2. 

30 "Single-chain Fv" or "sFv" antibody fragments comprise the VH and VL domains of 

antibody, wherein these domains are present in a single polypeptide chain. Preferably, the Fv 
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polypeptide furl her comprises a polypeptide linker between t he VH and VL domains which 
enables the sFv to form the desired structure for antigen binding. For a review of sFv, see 
Pluekthun in The Pharmacology of Monoclonal Antibodies , vol. 1 1 3. Rosenburgand Moore eds.. 
Springer- Vcrlag. New York, pp. 269-315 (1994). 

The term "diabodies" refers to small antibody fragments with two antigen-binding sites, 
5 which fragments comprise a heavy-chain variable domain (VH) connected to a light-chain 
variable domain (VL) in the same polypeptide chain (VH - VL). By using a linker that is too 
short to allow pairing between the two domains on the same chain, the domains are forced to pair 
with the complementary domains of another chain and create two antigen-binding sites. 
Diabodies are described more fully in, for example, EP 404,097; WO 93/1 1161; and Hoi linger 

10 et ah, Proc. Natl. Acad. Sci. USA, 90:6444^6448 ( 1 993). 

An "isolated" antibody is one which has been identified and separated and/or recovered 
from a component of its natural environment. Contaminant components of its natural 
environment are materials which would interfere with diagnostic or therapeutic uses for the 
antibody, and may include en/ymes, hormones, and other proteinaccous or nonproteinaceous 

15 solutes. In preferred embodiments, the antibody will be purified (l) to greater than 95 < c by 
weight of antibody as determined by the Lowry method, and most preferably more than 99 c/ c by 
weight, (2) to a degree sufficient to obtain at least 15 residues of N-lerminal or internal amino 
acid sequence by use of a spinning cup sequenator, or (3 ) to homogeneity by SDS-PAGE under 
reducing or nonreducing conditions using Coomassie blue or, preferably, silver stain. Isolated 

20 antibody includes the antibody in situ within recombinant cells since at least one component of 
the antibody s natural environment will not be present. Ordinarily, however, isolated antibody 
will be prepared by at least one purification step. 

An antibody that "specifically binds to" or is "specific for" a particular polypeptide or an 
epitope on a particular polypeptide is one that binds to that particular polypeptide or epitope on 

25 a particular polypeptide without substantially binding to any other polypeptide or polypeptide 
epitope. 

The word "label" when used herein refers to a detectable compound or composition which 
is conjugated directly or indirectly to the antibody so as to generate a "labeled" antibody. The 
label may be detectable by itself (e.g. radioisotope labels or fluorescent labels) or, in the case of 
30 an enzymatic label, may catalyze chemical alteration of a substrate compound or composition 
which is detectable. 
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By "solid phase" is meant a non-aqueous matrix to which the antibody of the present 
invention can adhere. Examples of solid phases encompassed herein include those formed 
partially or entirely of glass (e.g., controlled pore glass), polysaccharides (e.g., agarose), 
polyacrylamides, polystyrene, polyvinyl alcohol and silicones. In certain embodiments, 
depending on the context, the solid phase can comprise the well of an assay plate; in others it is 
5 a purification column (e.g., an affinity chromatography column). This term also includes a 
discontinuous solid phase of discrete particles, such as those described in U.S. Patent No. 
4,275,149. 

A "liposome" is a small vesicle composed of various types of lipids, phospholipids and/or 
surfactant which is useful for delivery of a drug (such as a SRT polypeptide or antibody thereto) 
1 0 to a mammal. The components of the liposome are commonly arranged in a bi layer formation, 
similar to the lipid arrangement of biological membranes. 

A "small molecule" is defined herein to have a molecular weight below about 500 
Daltons. 

An "oligonucleotide" or "oligomer" is a stretch of nucleotide residues which has a 

15 sufficient number of bases to be used in a polymerase chain reaction (PCR). These sequences 
are based on (or designed from) genomic or cDNA sequences and may be used to amplify, 
confirm, or reveal the presence of an identical, similar or complementary DNA or RNA in a 
particular cell or tissue. Oligonucleotides or oligomers comprise portions of a DNA sequence 
having at least about 10 nucleotides as described above. Oligonucleotides may be chemically 

20 synthesized and may be used as probes. 

"Probes" are nucleic acid sequences of variable length, preferably between about 10 and 
as many as about 6000 nucleotides, depending upon use. They are used in the detection of 
identical, similar or complementary nucleic acid sequences. Longer length probes are usually 
obtained from a natural or recombinant source, are highly specific and are often much slower to 

25 hybridize to a target nucleic acid than are oligomers. Probes may be single- or double-stranded 
and may be carefully designaed to have specificity in PCR, hybridization membrane-based, or 
ELISA-like technologies. 

"Detectably labeled" with regard to a nucleic acid molecule of the present invention 
means that the molecule has attached thereto, cither covalently or non-covalently, a compound 

30 which is detectable such as, for example, radionuclides, enzymes, fluorescent, 
chemi-luminescent, or chromogenic agents. Detectable labels associate with, establish the 
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presence of, and may allow quantification of a particular nucleic or amino acid sequence. 

A "portion" or "fragment" of a polynucleotide or nucleic acid molecule comprises all or 
any part of the nucleotide sequence having fewer nucleotides than about 6 kb, preferably fewer 
than about 1 kb which can be used as a probe. Such probes may be labelled with detectable 
labels using nick translation, Klenow fill-in reaction, PCR or other methods well known in the 
5 art. After pretesting to optimize reaction conditions and to eliminate false positives, nucleic acid 
probes may be used in Southern, Northern or in situ hybridizations to determine whether DNA 
or RNA encoding the protein is present in a biological sample, cell type, tissue, organ or 
organism. 
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Table 1 

/* 

('-(' increased from I 2 to 1 5 
* / is average of HQ 
B is average of Nl) 

match with Mop i> _M: slop-stop = 0: J (joker) match = 0 

7 

ftdefiiH- __M -8 A !: value of a match with a slop 7 

10 int _day|26][26] = { 

5 C D I F G II I J K L M N O P Q R S I V V W X Y Z */ 

{2 0-2. 0. 0.-4. ] .- 1 .- 1 , 0.- 1 .-2.- 1 ()._M. 1 . 0.-2. 1 . 1 . 0. 0.-6, 0.-3. ()}. 
{() 3-4.3.2.-5.0. 1.-2. 0.0.-3.-2. 2... M.- 1. 1.0.0.0.0.-2.-5.0.-3. I}. 
{-2.-4.15.-5.-5.-4.-3.-3.-2. 0.-5.-6.- 5.-4,_\1.-3,-5.-4, 0.-2. 0.-2.-8. 0. 0.-5}. 
15 ID 7 {() 3-5 4. 3.-6. I. I -2.0 0.-4.-3, 2._M.- 1 . 2.- 1 0.0.0.-2.-7.0.-4.2}, 
{ 0 2.-5 5.4.-5.0. I -2.0 0.-3.-2. l._M.-l.2.-l 0.0.0.-2.-7.0.-4.3}, 
{_4.-5.-4.-6.-5. 9.-5.-2. 1 . 0 -5. 2. ().-4._M. -5.-5.-4.-3.-3. 0.-1 . 0. 0. 7.-5}. 
{ 1 0-3 I. 0.-5. 5.-2,-3. 0. 2.-4.-3. 0_.M.-1. -1.-5. 1.0.0.-1.-7. 0.-5.0}. 
{-1.1-3 1. 1.-2.-2. 6.-2. 0. 0.-2.-2. 2_M. 0. 3. 2 - 1 1 . 0.-2,-5. 0. 0. 2}. 
20 / : I */ {-1,-2.-2,-2.-2. 1.-3.-2, 5.0-2, 2. 2,-2._M.-2.-2.-2.- 1 , 0. 0, 4.-5, 0.-1.-2}. 

{ 0 0 0, 0. 0, 0. 0. 0. 0. 0. 0, I). 0. 0._M, 0. 0. 0. 0. 0. 0. 0. 0. 0. 0. ()}. 
{-1,0-5 0.0.-5.-2.0,-2.0.5.-3.0,, l._M.-l. 1.3 0.0.0.-2.-3.0,-4.0}. 
{.2.-3.-6.-4.-3. 2.-4.-:. 2. 0 -3. 6. 4,-3._M.-3.-2.-5,-3.- 1 . 0. 2.-2. 0.-1.-2}. 
{-1 ,-2,-5,-5.-2. 0.-3.-2. 2. 0. 0. 4. 6,-2 _M.-2.- 1 . 0.-2.- 1 . 0. 2.-4, 0.-2.- 1}. 
25 /' N 7 {0 2.-4 2.1.-4.0.2.-2,0 1.-.1-2. 2...M-1. 1.0. 1.0.0.-2.-4.0.-2. I}, 

{ _ M .. M M .... M .. M _M ._.M _ M ... M ._M ._ M ,_M ._ M ._ M . 0._V1._M._M._M- M . . M . ... M ._M ._ VI ._ VI ._ M } . 
{ i .1.-1 -1.-1.-5.-1. 0.-2. 0 -1.-3. -2.-1. _M. ft. 0. 0. 1.0. 0.-1.-6, 0.-5. ()}. 
{ 0 1 -5 2.2.-5.-1. 3-2.0. 1.-2.-1. l._M.0.4. 1.-1.-1.0.-2.-5.0.-4.3}. 
{-2 0 -4 -1,-1 .-4.-3. 2.-2. 0 3.-3. 0, 0 _M. 0. 1.6. ().- 1 . 0.-2. 2. 0.-4, ()}. 
30 / S : / { 1.0 0.0.0.-3. 1.4.-1.0 0.-.V2. I._M 1.-1.0.2. 1.0.-1.-2.0.-3.0}. 

{ 1.0 -2 0.0,-3.0.-1 0.0 0,-1,-1. 0._M. ().-!.- 1. 1. 3.0.0.-5. 0.-3, ()}. 
{ 0 0 0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0. VI. 0. 0. 0. 0. 0. 0. 0. 0, 0. 0. ()}, 
{ 0 -2.-2.-2,-2.-1.-1.-2. 4. 0.-2. 2. 2 ,-2._M.- 1 .-2.-2.- 1 . 0. 0. 4.-6, 0.-2.-2}. 
{-6 -5 -X -7.-7. 0.-7.-3.-5. ().-3.-2.-4.-4._M.-6.-5. 2.-2.-5. 0.-6. 1 7. 0, 0.-6}. 
35 I- X / { 0 0. 0, 0. ()., 0. 0. 0. 0. 0. 0. O. 0. 0._\1. 0. 0. O. 0. O. O. 0. 0. 0. 0. ()}. 

{-3 -3 0,-4.-4. 7.-5. O.-l. 0.-4.- 1 .-2.-2._M.-5.-4.-4.-3.-3. 0.-2.0. 0. 1 0.-4}. 
{ 0. I -5 2. 3,-5,0. 2.-2. 0. 0,-2.-1. l._M. 0. 3. 0. 0. 0. 0.-2.-6. 0.-4. 4} 

} 
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Table 1 (conD 
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/* 
■ ; v 

#include <sldio.h> 
#includc <ciype.li> 

#dcfine MAX JMP 

#define MAXr.AP 

#define IMPS 

#define MX 



#dcflne- 
#definc 
#define 
#deline 
#dclim* 
#define 



DM AT 
DM IS 
DIN SO 
DINS I 
PINSO 
PINS I 



I (> / ;: max jinnps in a diag */ 

24 / : don't continue lo penalize imps larger than this */ 

1 024 / ;: max jmps in an paih */ 

4 / :: save if there's at least MX- 1 bases since last jmp *l 

3 / : value of matching bases */ 

0 / :: penally for mismatched bases */ 
8 / :: penally for a gap */ 

1 / : penally per base ■ / 
X / :: penally lor a gap */ 

4 / ; penalty per residue */ 



struct jmp { 

short 

20 unsigned short 

}: 



n|MAXJMP|: /* si/e of jmp meg lordely) */ 
x|MAXJMP|; /* base no. of jmp in seq x */ 

limits seq to 2 A I6 -1 */ 
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struct diag { 

int score: 

long oft set: 

short ijmp; 

struct jmp jp: 

}: 



struct path { 
int 

short 
int 



char 

char 

char 

char 

int 

int 

int 

int 

int 

int 

int 

int 

int 

long 

struct 

struct 



diag 
path 



spc; 

n|JMPS|: 
x(JMPS|: 



*ofile; 

*namex[2|; 

*prog: 

*seqx[21: 

dmax; 

dmaxO: 

dna: 

endgaps: 
gap\. gapy: 
len(). lenl: 
ngapx. ngapy: 
smax; 
*xbm: 
offset; 
*dx; 
PPI2I: 



/* score at last jmp */ 
/ offset of prev block */ 
/* current jmp index */ 
/* list of jmps */ 



/• number of leading spaces */ 

/" si/e of jmp (gap) */ 

A loc of jmp (last clem before gap) */ 



/ :: output file name */ 

/■'■ seq names: getseqsO 7 

/'-'■ prog name for err msgs */ 

/ :: seqs: getseqst ) */ 

/ :: best diag: nw( ) */ 

/ :: final diag */ 

/'■'• set if dna: main( ) */ 

/ ;; set if penalizing end gaps */ 

/ ;: total gaps in seqs */ 

/ :: seq lens ■ / 

I'' total size of gaps */ 

/ :: max score: nw( ) */ 

/ ;; bitmap for matching */ 

current offset in jmp file */ 
/ :: holds diagonals */ 
/ :: holds path for seqs */ 
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char 
char 



*calloc( ). *malloc< >. 'index! ). strcjiyt ); 
■ : gelseq( i. *g_ealloc( ): 
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Table 1 (conf) 

/ : ' Needleman-Wunseh alignment program 

* usage: [irons filel file2 

where fill? I and file2 arc two dna or two protein sequences. 

The sequences can be in upper- or lower-case an may contain ambiguity 

Any lines beginning with >' or '<' arc ignored 

Max file length is 6553? i limited by unsigned short x in the jmp struct ) 

A sequence with 1/3 or more of its elements ACGT13 is assumed to be DNA 
5 Output is in the file "align. out" 



The program ma\ create a imp file in Amp to hold into about traceback. 
■ : Original version developed under BSD 4.3 on a vax S650 

■■-/ 

#inrlude "nw.h" 
1 5 #include "day .li" 

static dbval|26] = { 

I . I 4.2. 1 3.0.0.4. 1 1 .0.0. 1 2.0 3. 1 5.0.0.0.5.0.X.K.7.9.( 1. 1 0.0 



static pbva!|20l={ 

I. 2|( l«CD*-'A*»|( l«('\"-*A')i. 4. 8. Id, 32. 64. 
128. 256. OxFFFFFFF. l«l(). 1«11. 2. I«:13. 1«I4. 
I«:«:15. 1«16. !«17, 1«:1S. I«:19. l<-:20. I«:21. 1«22. 
25 l«23. 1«24. 1«25|(1<-:CF:"-*A*))|( \«CQ'-'A')) 



main(ae. av > main 
int ac; 

: av[|: 



30 char 



piog = av[()]: 
if(ac '=3){ 

fprintf(stden\"us.ige: C A s filel tiie2\n". prog); 
35 fprintftstderr." where filel and file2 are two dna or two protein sequences. \n" ); 

fprintffstderr.'The sequences can be in upper- or lower-casc\rf): 

fprinif{stderr."Any lines beginning with V or '<:' are ignored\n">; 

fprintflstderr. "Output is in the file \"align.out\"\n">: 

exiU 1 ): 

40 } 

namex[0] = av| 1 1: 
namex| I ) = av[2 1: 
seqxfO] = getseq(namex[()| &len()): 
scqx| 1 ) = getseq(namex| 1 ] &lenl ): 
45 xbm = (dna)? _dbval : _pbval: 

endgaps -0; /* 1 to penalize endgaps */ 

ofile = "align. out": /* output file 7 

50 nw< ): /* fill in the matrix, gel the possible jmps */ 

readjmpst ); /* get the actual jmps 7 

printO: /* print stats, alignment */ 



55 



cleanup(O): /* unlink any tinp files */ 
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Table 1 (contM 

/* do the alignment, return besi score: main( > 
- dna: values in Pitch and Smith. PNAS. SO. 1382-1386. I 
* pro: PA M 250 values 

■ : When scores are equal, we prefer mismatches lo any gap. prefer 
5 :]: a new gap lo extending an ongoing gap. and prefer a gap in seqx 

■ : to a gap in seq y. 

*/ 

nw< ) 
{ 



char 


*px. *py: 


/ ' seqs and ptrs */ 


int 


* ndely. *dely: 


/ - : keep track of dely */ 


int 


ndelx, delx: 


/ ' keep track of delx */ 


int 


*tmp: 


/ for sw apping row 0. row 1 / 


int 


mis; 


/ score for each type */ 


int 


insO. ins] : 


/ :: insertion penalties */ 


register 


id: 


/'■■'■ diagonal index */ 


register 


ij: 


/'■- jmp index */ 


register 


*eol(). *eol 1 : 


!■■■'■ score for enrr, last row 


register 


xx. yy: 


/ ■■ index into seqs */ 



20 

dx = (struct diag *)g_calloe("lo gel diags". lenO+len 1 + 1 . sizeoft struct diag)); 



ndely = (int *)g_eal]oc( M to get ndely". len 1 + 1 . si/eolunt )): 
dely = (int *)g_calloe("lo get dely". len I + 1 . sizeofint ) ): 
25 eolO = (int * )g_ealloc( "to get colO". len 1 + 1 . sizeoftint)); 

coll = (int *)g_ealloe("to gel coll", len 1 + 1 . sizeof(int)): 
insO = (dna)? DINSO : PINSO; 
ins! = (dna)? D1NS1 : PINS I ; 

30 s max = - 1 0000: 

if (endgaps) { 

for (eol0[()| = dely [01 = -insO. yy = 1 : yy <= lenl : yy++){ 
eol0[yy] = delylyy] = colO[yy-l | - ins 1 : 
ndelylyy! = yy; 

35 } 

col()[0| = 0: /* Waterman Bull Math Biol 84 7 

} 

else 

for ( yy = 1 : yy <= len I : > > ++) 
40 delylyy I = -insO: 

/* fill in match matrix 

*/ 

for (px = seqx[0|. xx = 1: xx <= lend: px++. xx++}{ 
45 /* initialize first entry in col 

*/ 

if (endgaps) { 

if (xx = I ) 

col 110] = delx = -finsO+insI ): 

50 else 

coll|()| =delx = co!0|0| - insl; 
ndelx = xx: 

} 

else { 

55 eoll[()| = (): 

delx = -insO: 
ndelx = 0; 
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Table 1 (contM 

...nw 

for (py = seq\| 1 |. yy = 1 ; yy <;= I en I : p> ++. >>++){ 
mis = colOJyy- ! |; 
if (Una) 

5 mis += (xhmr^px-'A'l&xhm^py-'A'l)'.' DMA I : DMIS; 

else 

mis += _da\ [ p\ " A" |[ p\ ' A' |; 

/ : update penally fur del in x seq: 
10 :: favor new del over ongong del 

: ignore MAXGAP if weighting endgaps 
I 

if (endgaps || ndely[yy] < MAXGAP) { 

if ( col( )| y y | - in sO >= do I y [ y > ] ) { 
15 delylyy] = eol()| yy| - (ins()+insl ): 

ndely|vv] = 1; 

} else { 

dely[yy| -= ins I : 
ndelvl w|++; 

20 > " 

} else { 

if <col()[yy] - <ins()+insl > >= dely(yy ] ) { 

dely[yy| = eol()[yy] - (ins()+insl ): 
ndelvl vv] = 1 ; 

25 } else 



} 



ndely|yy)++: 



/ :: update penalty for del in y seq; 
30 favor new del over ongong del 

■I 

if (endgaps || ndelx < MAXGAP) { 

if <col 1 [>\- 1 | - ins() >= delx) { 

delx =eoll[yy-l] - (ins()+insl ); 
35 ndelx = 1: 

} else { 

delx -= ins I : 
ndelx++: 

} 

40 } else { 

if (col 1 1 >*> - 1 1 - (insO+insI ) >= delx) { 

delx = col 1 1 yy- 1 ] - (insO+insl ): 
ndelx = I: 

} else 

45 ndelx ++: 

} 

/ ;: pick the maximum score; we're favoring 
: mis over any del and delx over delv 

50 */ 
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...nw 

id = w - y\ + Ion I - I : 
if (mis >= dclx mis >= dely|y> |.) 
col 1 1 y v] = mis; 

5 else if » dclx >= dely|yy|) { 

col I j yy | = dclx; 
ij = dx| id|.ijnip: 

if ulx|id]Jp.n[()| <S:& (!dna || (ndclx >= MAXJMP 
&cv: xx > dx[id].jp.x|ijl+MX) || mis > dx[id |.scoiv+DINSO) ) { 
10 dx!id].ijmp++: 

if (++i/:>= MAXJMP) { 
vvritejmpsi id ); 
ij = dx[id l-ijmp -= 0: 
dx]id|.otfsel = offset: 

1 5 oi l set += si/eofi struct imp) + sizeoft offset ); 

} 

I 

dx|id].jp.n|ij | = ndclx; 
dx|id].jp.x|ij] = xx: 

20 dx|id].score = del\: 

} 

else { 

colllyyl = dcly[y> |: 
ij = dx[iJ|.ijnip: 

25 if (dx|idl,ip.n|0] (!dnu || (ndelyiyyl >= MAXJMP 

xx > dxlidl.jp x[ij|+MX) || mis> dx| id].score+IMNS<))) { 
dx|id].ijmp++: 
if (++ij :>= MAXJMP) { 
wrilejmps(id): 

30 ij = dx[ id j.ijmp - 0; 

dx| id |. offset = offset: 

offset += sizenft struct jmp) + sizcoft offset): 

} 

} 

35 dx[id].jp.n[ij] = -ndelylyy]: 

dx|id|.jp.x[ij] - x> : 
dx | id]. score = dcK |vv]: 

} 

if (xx == lcn()&& yy <lenl){ 
40 /* last col 

*/ 

if (endcaps) 

col I (yy | -= insO+ins I *< len I -yy ): 
if (col I ]yy] > smax ) { 
45 smax = col I [yy]: 

dmax = id: 

} 

} 

) 

50 if (endgaps xx < lenO) 

col 1 1 yy- 1 I -= insO+insl ■ (lcn()-xx); 
if (col I [yy- 1 ( > smax) { 

smax = col l [yy- 1 |: 
dmax = id; 

55 } 

imp = colt): col() = col l ; col l = imp: 

} 

(void) IreeUchar *)ndely): 
(void) lree((char *)dcly): 
60 (void) frce((char *)col(»: 
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( void) freed char : ' leol I ): } 
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Table 1 (conD 



prinK ) -- onl\ routine visible outside ihis module 



; ' getmaU ) -- trace back best path, count matches: prinu ) 

:: pr alignf) -- print alignment of described in array p||: prinK) 

* dumpblockt ) dump a block of lines with numbers, stars: pr_align< ) 
: numst ) -- put out a number line: dumpblock( ) 

10 : - putlineO -- put out a line (name. [numj. seq. [num|): dumpblocko 

* stars( ) - -put a line ol' stars: dumpblnckf ) 
stripnamet ) -- strip any path and prefix from a seqname 

*/ 



15 



#include "nw.h" 



20 



25 



#define SPC 3 
#define P_LINI£ 256 
#define P_SPC 3 

extern _day[26)|26l.: 
int olen; 
KILF fx: 



print O 
{ 



maximum output line : 7 
/■■ space between name or tuini and seq 7 



/* set output line length : 7 
/* output file */ 



int 



Ix. ly. first gap. lastgap; 



/- overlap */ 



print 



30 



35 



40 



45 



50 



55 



if (tlx = fopentofile, "w")) ==()}{ 

t print ft stderr."^ s: can't write l A s\n". prog, ofile): 
eleanupt I ): 

} 

(print ft fx. "<f'irst sequence: '/is (length = '/fd)\n'\ namex[()]. lent)): 
(print ft fx, "<second sequence: ( /< s (length = r /fd)\n". namex| l [. lenl ): 
olen = 60: 
Ix = lent): 
ly = lenl: 

lirstgap = lastgap = 0; 

if (dmax < lenl - 1 ) { / x leading gap in x */ 

pp|0|.spe = firstgap = len 1 - dmax - 1 ; 
Iv -= pp[0].spc; 

} 

else if (dmax > lenl - 1 ) { /* leading gap in y */ 
pp( 1 ].spe = lirstgap = dmax - (lenl - 1 ); 
Ix -= pp| 1 |.spc: 

} 

if (dmaxO < lenO - 1 ) { /' ;: trailing gap in x */ 

lastgap = lent) - dmaxO - 1 : 
lx -= lastgap; 

} 

else if (dmaxO > lent) - 1 ) { /* trailing gap in y */ 
lastgap = dmaxO - (lend - 1 >: 
ly -= lastgap: 

} 

getmatdx. ly. lirstgap. lastgap): 
pr_align( ); 
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; ' : trace back the best path, count matches 

*/ 

static 

get mall lx. I\. t'iistgap. lastgap) 
int l\. ly: 

inl lirslgap. lastgap: 



i 



/ "core" (minus endgaps) */ 
/■■■ lead i ng trailing overlap */ 



int 

char 

double 

register 

register char 



mil. iO. i I . si/0, si/ ! : 

outx|32|: 

pet: 

n(). nl; 

*p(). *pl: 



/ ■■ yet total matches, score 

*/ 

iO=il = si/.0= si/1 =0: 
p() = seqx[0| + pp| 1 j.spc: 
pi = seq\| I ] + pp|()|.spe: 
n() = pp| 1 j.spc + I : 
ii ] = pp|()].spc + I : 



nm = 0: 

while { *p() *pl ) { 
if (si/0) { 

pi ++; 
n 1 ++: 
si/0--: 

} 

else if (si/1 ) { 

p()++: 
n()++; 

si/1-: 

} 

else { 

if (xhni|*p()-"A*j&xhni|*p]-'A'|) 

nni++: 
if (n()++ == pp|0|.x[iO]) 

si/.0 = pp[0|.n|iO++|: 
if (nl++ == pp| 1 |.x|il 1) 

si/ 1 = pp| 1 ].n|i 1 ++|: 

p()++; 

p 1 + -K 

} 

} 

/ ;: pet homology: 

li: it" penalizing endgaps. hase is the shorter seq 
else, knock off overhangs and take shorter core 

*l 

if (endgaps) 

lx = (len()< lenll? lenO : lenl; 

else 

lx = (lx < ly)7 lx : ly: 
pet = 100. (double)nm/(doul)le)l\: 
tprinlKlx. "\n"): 

t'printfifx. "< ( /iii matches in an overlap of f A&. ',V.2f percent similarity^", 
nm. (nm == 1 )'.' "" : "es". lx. pet): 
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Table 1 (conf) 



fpnnltdx. "<gaps in first sequence: ' <<\". gapx): 
il'(gapx) { 

(void) sprintHoutx. " f<d '< s'ys)". 

ngapx. (dna)7 "hasc":"residue". (ngapx == I )'.' "":"s"): 
I print t( fx."' i s". oui x ): 



.getmat 



10 



15 



20 



25 



30 



35 



40 



fprinlft fx. ". gaps in second sequence: <; <d". gapy): 
if (gapy) { 

(void) sprint f( nut x. " (97 d ' < s ( '< s)". 

ngapy. Ulna)'.' "base":"residue". (ngapy == ! )'.' "":"s"): 
tprintf(fx,"9f s". ouix); 



} 

if <dna> 



else 



fprintfdx, 

"\n<seore: 9vd (match = 9rd. mismatch = v /< d. gap penalty - f Hi + ',/d per base An" 
smax. DM AT. DMIS. DINSO. DINSl ): 



I print ft fx. 

"\n<score: C A d (Dayhoff PAM 250 matrix, gap penalty = ( ul + 9*d per residue An" 
smax. PINSO. PINSI): 
if (endgaps) 

t'pri ntf( fx. 

"<endgaps penalized, left endgap: ( >i d </i s l }< s. right endgap: 9fd ( < s< < s\n '. 
first yap. (dna)7 "base" : "residue", (first gap == I )7 "" : "s". 
lastgap. (dna)7 "base " : "residue", (laslgap -= I )7 "" : "s"); 



else 



fprintfdx. "<endgaps not penal i/ed\n"): 



static 


nm: 


/■ 


matches in core -- for checking */ 


static 


I max; 


/■ 


lengths of stripped file names */ 


static 


>j[2|: 


/■■ 


jmp index for a path */ 


static- 


nc|2|: 


I- 


number at start of current line */ 


static 


ni[2|: 


I- 


current elem number - for gapping */ 


static 


si/[2]: 






static char 


*ps[2]: 


/■ 


ptr to current element */ 


static char 


*po|2]: 


/■ 


ptr to next output char slot */ 


static char 


out[2]|P_LINE]; 


1- 


output line */ 


static char 


siar|P_LINFi|: 


/- 


set by starsi ) */ 



45 



50 



/* 

* print alignment of described in struct path pp|] 

*/ 

static 

pr_a!ign() 
{ 



int 
int 

register 



nn: 

more; 

i: 



/ ' char count */ 



pr_align 



55 



for (i = (). [max = 0; i < 2; i++) { 

nn = stripname(namex| i | ): 
if (nn > I max) 

Imax = nn; 



nc|i|= I; 
ni|i] = I; 
si/|i] = ij[il = 0: 
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ps|i| = sci|\[i]: 

po|i| = oui|i|: } 



71 



10 



Table 1 (conf) 

for (mi = nm = 0. more = i : more: ) { ...praligll 

lor li = more = 0: i < 2; i++i { 
/• 

do we ha\e more of this sequence'.' 

: 7 

if i !*ps|i|) 

continue: 

more++: 



if (pp[i].spc) { /* leading spaee */ 
*po|i|++ = ' 
pp|i|.spe-: 

15 } 

else if (si/Ji |) { /* in a gap */ 
* P o[il++ = 
si/[i|-: 

} 

20 else { / ; we're putting a seq element 

•7 

*po|i| = *ps[i]: 
if (islowert *ps| i | )) 

*ps|i] = toupper( :i: ps| i |): 

25 po|i]++: 

ps|i|++; 

/* 

- f are we at next eap tor (his seq? 

30 */ 

if ( ni|i] == pp[i|.x|ij[i]]) { 

/* 

* we need to merge all gaps 

* at this location 

35 v 

siz[i| = pp|i|.n[ij[il++|: 
while (ni|ij == pp[i].x(ij[i]]) 

si/[i] += pp|i].n[ij[i|++|: 

} 

40 ni|i]++: 

} 

} 

if (++nn == olen || !more mi) { 
dumphlockt ): 

45 for (i = 0; i < 2: i++) 

po|i| = oui[i]: 

nn = 0: 

} 

} 

50 } 

/* 

* dump a block of lines, including numbers, stars: pr_align( ) 
*/ 

55 static 

dumpbiocko dumpblock 

{ 

register i: 



72 




73 



Table 1 (conV) 



10 



15 



( void ) pukM "\n\ fx); 
for ti = 0: i < 2: i++) { 

if <*out[i| (*oiu|i) != * ' || (po[ii! != * ' $ 
if ti ™ 0) 

minis* i ): 
if <i = () *oul| ) |t 
stars! ); 

pullinc(i): 

if ti == o *out| l|) 

fprintftfx. star); 

if (i == l) 

nums(i ); 



} 



.dumpblock 



} 



20 



25 



30 



35 



40 



45 



/* 

■■■ pm out a number line: dumpbiockt > 

*/ 

static 



nums(ix ) 



int ix; I* index in out|J holding seq line */ 

char nlinclPJJNlil: 

register i.j: 

register char * pn. *px. *py; 

fortpn = nline. i = 0: i < lmax+P_SPC: pn++) 

*pn 

for ( i = nc|ix|. py = out[ix): *py: py++. pn++) { 
" " || *py ==•-') 
*pn = ' 



if (*py == 
else { 



if (\</< io== 0|| (i == 1 && nc[ix] != 1 )) { 

j = (j < ())7 -j ; i; 

for (px = pn; j; j /= 10. px--) 
*px =y* l()+"()"; 

if (i < 0) 

*px = '-'; 

} 

else 

* pn = ' ' ; 



nums 



} 



50 



*pn = '\0': 
nefix] = i; 

for (pn = nline; *pn; pn++) 

(void) pnte(*pn, fx): 
(void) putc(*\n\ fx); 



55 / 

* put out a line (name, [num|, seq. |num|): dumpbiockt) 

*/ 

static 

putlinctix ) 



putline 
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.putline 



10 



15 



i ii t i: 
register char p\: 

for (px = namex|ix|. i = 0: px A: <t "px 
( void ) putct *p.\. fx ): 

for ( : i < Imax+P. SPC; i++) 
(void) putct ' '.fx): 

/■■- these count from I : 

* ni[ | is current element (from I ) 

* nc[] is number at start of current line 

*/ 

fortpx =out|ix]: px: p\++) 

(void) pulc(*px&0x7F. fx): 
(void) putct "\n". fx): 



= ':": px++. i+-H 



20 



25 



30 



35 



40 



45 



50 



/* 

* put a line of stars (seqs always in oui[()J. out| I ]): dumphlockO 

*/ 

static 



slarsO 
{ 



int 

register char 



*p(). "p l . ex. *px; 



if (!*oul|()| || f*out|()| == " && *<po[0]) == ' •) || 
!*om| I | || (*out| !]=='•&& *(po| ]]>==■•)) 
return: 

px = star; 

for (i = lmax+P_SPC: i: i-) 

:: p\++ = ' "; 

for (p() = out[()]. pi =out|l]: *p() && *pl: p()++, p\++){ 
if (isa1pha(*p«) isalphat *p ! ) ) { 

if <xbm|*pO-"A*|&xhm[*pl-'A'R 
ex = 
nni++; 

} 

else if (!dna && _day[*p()-' A* |[*pl -* A' ) > 0) 

cx = Y: 

else 

ex = ' *; 

} 

else 

cx = " ': 
*px++ - cx; 



stars 
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*px++ = *\n'; 
-px = "NO": 
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Table 1 (conf) 


> 




/ 

* strip path or prefix from pn. return len: pr..alivim ) 

*/ 




5 


static 

slripnamet pn) 

char 'pn: /* file name (may be path) ■'■/ 

{ 

register char p\. *py: 


stripname 


10 
15 


py = 0: 

for (px = pn: ' p\: px++) 
if (*px == 7") 

py = px + 1 : 

if (py) 

(void) strcpytpn. py); 
relurn(slrlen(pn)); 

} 




20 






25 






30 






35 






40 






45 






50 
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60 


78 






79 



Table 1 (conD 



10 



15 



20 



25 



;: cleanups -- cleanup any imp file 

* getseql ) -- read in seq. set dna. ten. mavlen 

g calloci ) -- calloci ) with error chock in 

icadjmpsO -- get t ho good jmps. from imp file if necessary 
■ wiitejmpsi ) -- write a filled array of jmps to a imp file: nw< ) 
-7 

#include "nw.h" 
#include <:sys/file.h> 



char 
I IU- 

int 
long 



"jname = 'Vtmp/homgXXXXXX": 



clean u p( ): 
I seek! ): 



/» 

* remove any imp file if we blow 

*/ 

cleunup(i) 

int i 

{ 

if (fj) 
exit(i); 



( void) unlinkl jname): 



/* tmp file for jmps 7 
/* cleanup imp file */ 



cleanup 



30 



35 



40 



45 



50 



55 



/* 

;i: read, return ptr to seq. set dna. len. maxlen 

* skip lines starling with '<". or >* 

* seq in upper or lower case 

*/ 

char 

getseqf file, len ) 

char *l"ile: /* tile name */ 
int Men: /* seq len */ 



{ 



char 

register char 
int 

FILE 



line 1 1024]. *pseq: 
*px. *py: 
natge. lien: 



getseq 



if ({fp = fopcn(file."r")) ==()){ 

fprinlf(stderr."9<f s: can't read ( < s\n". prog, file); 
exi(( 1 ): 

} 

tlen = nalge = 0: 

while (lgels( line, 1024. fp)) { 

if (/Mine == *:* j| *linc == '<' || *line == *>") 

continue: 
for (px = line; *px != '\n"; px++) 

if (isuppcr(*px) || islowcr(*px)) 
llen++: 

} 

if ((pseq = malloc<(unsigni*d)(tlen+f>))) ==()){ 

fprintfi :sklerr." r . 4 s: mallocO failed to get V A d bytes for 'A s\n". prog, tlen+d. file): 
e.xit( 1 ); 

} 

pseq[0| = pseql I | = pseq|2| = pseq|3j = '\0': 
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10 



15 



20 



25 



30 



} 



py = pseq + 4; 
,: len = lien: 
rewinddpK 

while ( Igetsi line. I024. fpn { 

if (* line == ':' || Mine == '<* || Mine == ">") 

continue: 
for tpx = line; *px != An": px++)| 
if (isupper( *px)) 

*py++ = *px; 
else if (islower(*px)) 

*P>'++ = louppen '■■ px ); 
if (indcx("ATCiC'U".*<py-l ))) 
naige++; 

} 

} 

*pv++ = '\i)': 
*py = AO": 
(void) felose(fp): 
dna = naigc > lilcn/3 ); 
return(pseq+4): 



char * 

g_ealloc( nisii. nx. s/ ) 

char *msg; 
int nx. s/: 



/* program, calling routine ; 7 

/ i: number and si/e of elements */ 



{ 



.getseq 



gcalloc 



char 



*px. *calloc< ): 



35 



40 



if ((px = callocU unsigned )nx. ( unsigned Is/)) ==()){ 
if (*msg) { 

rprintftstderr. "';vs: g_calloe() failed 9? s (n=9rd. s/.=^d)\n". prog. msg. nx. s/): 
exit( I ); 

} 

} 

return(px); 
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* get final jmps from dx| ] or tmp file, set pp| |. reset dmax: main( ) 

•/ 

readjmps( ) 
{ 

int fd= I; 

int si/. i(), i I ; 

register i. j. xx: 



read jmps 



if <fj) I 

(void) fclose(fj): 

if ((I'd = opeiKjname. <D_RDONLY. ()))< 0) { 

fprintf(stderr. "'"< s: ean'i npciK ) ( < s\n". prog, j name): 
eleanup( I ): 

} 

} 

for (i = i() = i 1 = 0. dmaxO = dmax. x\ = lent): : i++) { 
while ( 1 ) { 

for (j = dx[dmax|.ijmp: j >=()<£& dx[dmax |.jp.x|j| >= xx; j--) 
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Table 1 (conD 

...readjmps 

if (j < 0 dx| dmax |. offset <x:<\: I j I { 

(void) Iseekdd. dx[ dmax]. offset. <>): 

(void) readi t"d. (char * )<S:dx|dmax |Jp. sizeoff struct jinp)): 

( void I read< I'd. (char * ><fidx | dmax |. offset. sizeof(dx|dmax | .offset H: 

dx|dmax|.ijmp = MAXJMP-1: 



} 

else 



break: 



if (i :-= JMPS) { 

I print f( stderr, "Cf s: too many gaps in alignmeniNn". prog); 
cleanupt 1 ): 

} 

15 if (j :-=()){ 

si/. = dx|dmax].jp.n|j|: 
xx = dx[dmax|.jp.xlj |: 
dmax += si/.: 

if {si/ < 0) { /* gap in second seq */ 

20 pp[l].n[il] = -si/.; 

xx += si/; 

/* id = xx - yy + Icnl - I 

*/ 

pp[ 1 ' 1 = xx - dmax + Icn I - 1 ; 
25 gapy++: 

ngapy -= si/: 
/* ignore MAXGAP when doing endgaps */ 

si/. = (-si/ < MAXGAP || endgaps)'/ -si/ : MAXGAP: 
il++: 

30 } 

else if (si/ > 0) { /* gap in first seq */ 
pp|()|.n|i()| = si/; 
pp|0|.x[iO| = xx; 
gapx++; 
ngapx += si/.: 
/* ignore MAXGAP when doing endgaps */ 

si/ = (si/ < MAXGAP || endgaps)? si/ : MAXGAP: 
i<)++; 

} 

40 } 

else 

break: 

} 

45 /* reverse the order ot'jmps 

*/ 

for (j = 0, i()--:j < i(): j++. i0-) { 

i = ppior.nlj]: p P [0|.n[j| = pp[()].n[i()|: pp[0].n[i()| = i: 
i = pp[()l.x[j]; pp[()|.x[j| = pp|0i.x|i()|: pp[0].x|i()| = i: 

50 } 

for (j =0. i 1 — - f < il:j++. i 1 — > { 

i = ppl I | n| j ]: pp| 1 |.n| j | = pp[ 1 |.n|i 1 1: pp[ 1 ].n[i I | = i; 
i = PPl 1 1-xUl: p P [ I |.x|.il = pp| 1 |.x|i 1 1: pp| 1 ].x|i 1 1 = i: 

} 

55 if(fd>=0) 

(void) close) Id): 

if(lj) { 

(void) unlink(jnamc); 
fj = 0: 

60 ofl'sel = 0: 
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• • 

Table 1 (conf) 






/ '* 

:1 write a Tilled jmp struct offset of the prcv one (it am ): nvv( ) 




5 


/ 

writejmpsnx) 

inl ix; 

{ 

char *mklcnipi ): 


write j mps 


10 
15 

20 


if (!tj) { 

if (mktemp(jnamc) <()) { 

t'prinlt'tstderr. " c /ty. can't mktempO ( ^s\n", prog, jname): 
cleanup* 1 ): 

} 

if ((fj = lopeiK jnamc, "w")) ==()){ 

fprintflstdcrr. "9is: can't write 9<s\n". proji. jname): 
cxit( 1 ): 

} 

} 

(void) fwrite((char *)&dx|ix|.jp, si/eof( struct jmp). 1. fj): 
(void) fwrileUchar *)&dx|ix].oft"sct. si/.eof(dx|ix|.olTsci). 1. Ij); 

I 
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Table 2 



PRO X X X X X X X X X X X X X X X (Length = 1 5 amino acids) 

Comparison Protein XXXXXVYYYYYY (Length = 1 2 amino acids) 

c /< amino acid sequence identity = 

(the number of identically matching amino acid residues between the two polypeptide sequences as determined by 
ALIGN-2) divided by (the total number of amino acid residues of the PRO polypeptide) = 

5 divided by 15 = 33.39f- 
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Table 3 



PRO XXXXXXXXXX {Length = 10 amino acids) 

Comparison Protein XXXXXYYYYYYZ/.YZ {Length = IS amino acids) 

( /i amino acid sequence identity = 

(the number of identically matching amino acid residues between the two polypeptide sequences as determined by 
ALICjN-2) divided by {the total number of amino acid residues of the PRO polypeptide) = 

5 divided by 10 = 50% 
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Table 4 



PRO-UNA NNNNNNNNNNNNNN (Length = 14 nucleotides) 

Comparison DNA NNNNNNLI LLLLLLLL (Length = 16 nucleotides) 

( /( nucleic acid sequence identity = 

(the number of identically matching nucleotides between the two nucleic acid sequences as determined by ALIGN- 
2) divided by (the total number of nucleotides of the PRO-DNA nucleic acid sequence) = 

6 divided by 14 = 42.9', 
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Table 5 



PRO-DNA NNNNNNNNNNNN (Length = 1 2 nucleotides) 

Comparison DNA NNNNLLLVV (Length = 9 nucleotides) 

( A nucleic acid sequence identity = 

(the number of identically matching nucleotides between the two nucleic acid sequences as determined by ALIGN- 
2) divided by (the total number of nucleotides of the PRO-DNA nucleic acid sequence) = 

4 divided by 12 = 33.39? 
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II. 



Compositions and Methods of the Invention 



A. Full-length SRT Polypeptides 



The present invention provides newly identified and isolated polynucleotide sequences 
encoding at least a portion of full-length human polypeptides referred to in the present 
application as SRT polypeptides. In particular, cDNAs encoding at least a portion of SRT 
5 polypeptides have been identified and isolated, as disclosed in further detail in the Examples 
below. For sake of simplicity, in the present specification the polypeptides encoded by nucleic 
acid molecules disclosed herein as well as all further native homologues and variants included 
in the foregoing definition of SRT, will be referred to as "SRT", regardless of their origin or 
mode ol preparation. 



B. SRT Polypeptide Variants 
In addition to the native sequence SRT polypeptides described herein, it is contemplated 
that SRT variants can be prepared. SRT variants can be prepared by introducing appropriate 
nucleotide changes into the SRT DNA, and/or by synthesis of the desired SRT polypeptide. 
15 Those skilled in the art will appreciate that amino acid changes may alter post-translational 
processes of the SRT, such as changing the number or position of glycosylation sites or altering 
the membrane anchoring characteristics. 

Variations in the native sequence SRT or in various domains of the SRT described herein, 
can be made, for example, using any of the techniques and guidelines for conservative and non- 
20 conservative mutations set forth, for instance, in U.S. Patent No. 5,364,934. Variations may be 
a substitution, deletion or insertion of one or more codons encoding the SRT that results in a 
change in the amino acid sequence of the SRT as compared with the native sequence SRT. 
Optionally the variation is by substitution of at least one amino acid with any other amino acid 
in one or more of the domains of the SRT. Guidance in determining which amino acid residue 
25 may be inserted, substituted or deleted without adversely affecting the desired activity may be 
found by comparing the sequence of the SRT with that of homologous known protein molecules 
and minimizing the number of amino acid sequence changes made in regions of high homology. 
Amino acid substitutions can be the result of replacing one amino acid with another amino acid 
having similar structural and/or chemical properties, such as the replacement of a leucine with 
30 a serine, i.e., conservative amino acid replacements. Insertions or deletions may optionally be 
in the range of about 1 to 5 amino acids. The variation allowed may be determined by 
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systematically making insertions, deletions or substitutions of amino acids in the sequence and 
testing the resulting variants for activity exhibited by the full-length or mature native sequence. 

SRT polypeptide fragments are provided herein. Such fragments may be truncated at the 
N-terminus or C-terminus. or may lack internal residues, for example, when compared with a 
full-length native protein. Certain fragments lack amino acid residues that are not essential for 
5 a desired biological activity of the SRT polypeptide. 

SRT fragments may be prepared by any of a number of conventional techniques. Desired 
peptide fragments may be chemically synthesized. An alternative approach involves generating 
SRT fragments by enzymatic digestion, e.g., by treating the protein with an enzyme known to 
cleave proteins at sites defined by particular amino acid residues, or by digesting the DNA with 

10 suitable restriction enzymes and isolating the desired fragment. Yet another suitable technique 
involves isolating and amplifying a DNA fragment encoding a desired polypeptide fragment, by 
polymerase chain reaction (PCR). Oligonucleotides that define the desired termini of the DNA 
fragment are employed at the 5' and 3' primers in the PCR. Preferably. SRT polypeptide 
fragments share at least one biological and/or immunological activity with the corresponding 

15 native SRT polypeptide. 

In particular embodiments, conservative substitutions of interest arc shown in Table 6 
under the heading of preferred substitutions. If such substitutions result in a change in biological 
activity, then more substantial changes, denominated exemplary substitutions in Table 6, or as 
further described below in reference to amino acid classes, are introduced and the products 

20 screened. 

Table 6 



25 



Original 
Residue 



Exemplary 

Substitutions 



Preferred 

Substitutions 



30 
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Ala (A) 
Arg (R) 
Asn (N) 
Asp(D) 
Cys (C) 
Gin (Q) 
Glu (E) 
Gly(G) 
His (H) 
He (I) 



val; leu; ile 
lys; gin; asn 

gin; his; lys; arg 

glu 

ser 
asn 
asp 

pro; ala 

asn; gin; lys; arg 
leu; val; met; ala; phe; 
norleucine 



glu 



val 
lys 
gin 

ser 
asn 
asp 



ala 



leu 
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Leu (L) 


norlcucinc; ilc; val; 
met; ala; phe 


ile 




Lys(K) 


arg; gin; asn 




arg 


Met (M) 


leu; phe; ile 


leu 




Phe (F) 


leu; val; ile: ala; tyr 




leu 


Pro (P) 


ala 




ala 


Scr(S) 


thr 




thr 


Thr (T) 


ser 




ser 


Tip (W) 


tyr; phe 




tyr 


Tyr(Y) 


trp; phe; thr; ser 






Val (V) 


ile; leu; met; phe; 







ala; norleueine leu 



Substantial modifications in function or immunological identity of the SRT polypeptide 

arc accomplished by selecting substitutions that differ significantly in their effect on maintaining 
15 (a) the structure of the polypeptide backbone in the area of the substitution, for example, as a 

sheet or helical conformation, (b) the charge or hydrophobicity of the molecule at the target site. 

or (c) the bulk of the side chain. Naturally occurring residues are divided into groups based on 

common side-chain properties; 

(l ) hydrophobic: norleueine, met, ala. val, leu, ile; 
20 (2) neutral hydrophilic: eys. ser. thr; 

(3) acidic: asp, glu; 

(4) basic: asn, gin, his, lys, arg; 

(5) residues that influence chain orientation: gly, pro; and 

(6) aromatic: trp, tyr, phe. 

25 Non-conservative substitutions will entail exchanging a member of one of these classes 

for another class. Such substituted residues also may be introduced into the conservative 
substitution sites or, more preferably, into the remaining (non-conserved) sites. 

The variations can be made using methods known in the art such as oligonucleotide- 
mcdiated (site-directed) mutagenesis, alanine scanning, and PCR mutagenesis. Site-directed 

30 mutagenesis [Carter ct al., Nucl. Acids Res. , 13:4331 (1986); Zollcr et al., Nucl. Acids Res. , 
10:6487 (1987)], cassette mutagenesis [Wells ct al.. Gene . 34:315 (1985)], restriction selection 
mutagenesis [Wells et al., Philos. Trans. R. Soc. London ScrA , 317 :415 (1986)] or other known 
techniques can be performed on the cloned DNA to produce the SRT variant DNA. 

Scanning amino acid analysis can also be employed to identify one or more amino acids 

35 along a contiguous sequence. Among the preferred scanning amino acids are relatively small. 
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neutral amino acids. Such amino acids include alanine, glycine, serine, and cysteine. Alanine 
is typically a preferred scanning amino acid among this group because it eliminates the side-chain 
beyond the beta-carbon and is less likely to alter the main-chain conformation of the variant 
[Cunningham and Wells, Science . 244 : 1081 -1085 (1989)]. Alanine is also typically preferred 
because it is the most common amino acid. Further, it is frequently found in both buried and 
5 exposed positions ICreighton, The Proteins . (W.H. Freeman & Co., N.Y.); Chothia, J. Mol. Biol. , 
1 50 ; 1 (1976)]. If alanine substitution does not yield adequate amounts of variant, an isoteric 
amino acid can be used. 

C. Modifications of SRT Polypeptides 

10 Covalent modifications of SRT polypeptides are included within the scope of this 

invention. One type of covalent modification includes reacting targeted amino acid residues of 
a SRT polypeptide with an organic derivatizing agent that is capable of reacting with selected 
side chains or the N- or C- terminal residues of the SRT. Derivatization with Afunctional agents 
is useful, for instance, for crossl inking SRT to a water-insoluble support matrix or surface for use 

15 in the method for purifying anti-SRT antibodies, and vice-versa. Commonly used crosslinking 
agents include, e.g., 1 , 1 -bis(diazoacelyl)-2-phenylcthane, glutaraldehyde, N-hydroxysuccinimide 
esters, for example, esters with 4-azidosalicylic acid, homobi functional imidocsters, including 
disuccinimidyl esters such as 3,3-dithiobis(succinimidylpropionate), Afunctional maleimides 
such as bis-N-malcimido- 1 ,8-octane and agents such as methyl-3-[(p- 

20 azidophenyl)dithiolpropioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyl residues to the 
corresponding glutamyl and aspartyl residues, respectively, hydroxylation of proline and lysine, 
phosphorylation of hydroxy! groups of seryl or threonyl residues, methylation of the a-amino 
groups of lysine, arginine, and histidinc side chains [T.E. Creighton, Proteins: Structure and 

25 Molecular Properties , W.H. Freeman & Co., San Francisco, pp. 79-86 ( 1 983)], acetylation of the 
N-terminal amine, and amidation of any C-terminal carboxyl group. 

Another type of covalent modification of the SRT polypeptide included within the scope 
of this invention comprises altering the native glycosylation pattern of the polypeptide. "Altering 
the native glycosylation pattern" is intended for purposes herein to mean deleting one or more 

30 carbohydrate moieties found in native sequence SRT (either by removing the underlying 
glycosylation site or by deleting the glycosylation by chemical and/or enzymatic means), and/or 
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adding one or more glycosylation sites that are not present in the native sequence SRT. In 
addition, the phrase includes qualitative changes in the glycosylation of the native proteins, 
involving a change in the nature and proportions of the various carbohydrate moieties present. 

Addition of glycosylation sites to the SRT polypeptide may he accomplished by altering 
the amino acid sequence. The alteration may be made, for example, by the addition of, or 
5 substitution by, one or more serine or threonine residues to the native sequence SRT (for O- 
1 inked glycosylation sites). The SRT amino acid sequence may optionally be altered through 
changes at the DNA level, particularly by mutating the DNA encoding the SRT polypeptide at 
preselected bases such that codons are generated that will translate into the desired amino acids. 

Another means of increasing the number of carbohydrate moieties on the SRT 
10 polypeptide is by chemical or enzymatic coupling of glycosides to the polypeptide. Such 
methods are described in the art, e.g., in WO 87/05330 published 1 1 September 1987, and in 
Aplin and Wriston, CRC Crit. Rev. Biochem. , pp. 259-306 ( 198 1 ). 

Removal of carbohydrate moieties present on the SRT polypeptide may be accomplished 
chemically or enzymatically or by mutational substitution of codons encoding for amino acid 
1 5 residues that serve as targets for glycosylation. Chemical dcglycosylation techniques are known 
in the art and described, for instance, by Hakimuddin et al., Arch. Biochem. Biophys. , 259:52 
( 1 987) and by Edge et al.. Anal. Biochem. 1 18:131 ( 1 98 1 ). Enzymatic cleavage of carbohydrate 
moieties on polypeptides can be achieved by the use of a variety of endo- and exo-glycosidases 
as described by Thotakura et al., Meth. Enzymol. . 138:350 (1987). 
20 Another type of covalent modification of SRT comprises linking the SRT polypeptide to 

one of a variety of nonproteinaceous polymers, e.g., polyethylene glycol (PEG), polypropylene 
glycol, or polyoxyalkylenes, in the manner set forth in U.S. Patent Nos. 4,640,835; 4,496,689; 
4,30 1 , J 44; 4,670,4 1 7; 4,79 1 . 1 92 or 4, 1 79.337. 

The SRT polypeptides of the present invention may also be modified in a way to form 
25 chimeric molecules comprising SRT fused to another, heterologous polypeptide or amino acid 
sequence. 

In one embodiment, such a chimeric molecule comprises a fusion of the SRT with a tag 
polypeptide which provides an epitope to which an anti-tag antibody can selectively bind. The 
epitope tag is generally placed at the amino- orcarboxyl- terminus of the SRT. The presence of 
30 such epitopc-tagged forms of the SRT can be detected using an antibody against the tag 
polypeptide. Also, provision of the epitope tag enables the SRT to be readily purified by affinity 
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purification using an anti-tag antibody or another type of affinity matrix that binds to the epitope 
tag. Various tag polypeptides and their respective antibodies are well known in the art. 
Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; the flu 
HA tag polypeptide and its antibody 1 2CA5 [ Field et al., Mol.Cell. Biol. . 8:21 59-2 1 65 ( 1 988)]; 
the e-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 9E10 antibodies thereto [Evan et al., 
5 Molecular and Cellular Biolo gy, 5:3610-3616 (1985)]; and the Herpes Simplex virus 
glycoprotein D (gD) tag and its antibody [Paborsky et al.. Protein Engineering , 3(6):547-553 
(1990)]. Other tag polypeptides include the Flag-peptide [Hopp et al., BioTechnology , 6:1204- 
12 10 (1988)]; the KT3 epitope peptide I Martin et al.. Science , 255 : 1 92- 1 94 ( 1 992)1 ; an q-tubulin 
epitope peptide [Skinner et al.. J. Biol. Chem. . 266: 1 5 1 63- 1 5 1 66 ( 1 99 1 )]; and the T7 gene 10 

1 0 protein peptide tag [Lutz-Freyermuth et al., Proc. Natl. Acad. Sci. USA . 87:6393-6397 (1990)]. 

In an alternative embodiment, the chimeric molecule may comprise a fusion of the SRT 
with an immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the 
chimeric molecule (also referred to as an kv immunoadhesin ,, ), such a fusion could be to the Fc 
region of an IgG molecule. The Ig fusions preferably include the substitution of a soluble 

15 (transmembrane domain deleted or inactivated) form of a SRT polypeptide in place of at least 
one variable region within an Ig molecule. In a particularly preferred embodiment, the 
immunoglobulin fusion includes the hinge, CH2 and CH3, or the hinge, CHI, CH2 and CH3 
regions of an IgG I molecule. For the production of immunoglobulin fusions see also US Patent 
No. 5,428,130 issued June 27, 1995. 

20 

D. Preparation of SRT Polypeptides 
The description below relates primarily to production of SRT by culturing cells 
transformed or transfected with a vector containing SRT nucleic acid. It is, of course, 
contemplated that alternative methods, which are well known in the art, may be employed to 

25 prepare SRT. For instance, the SRT sequence, or portions thereof, may be produced by direct 
peptide synthesis using solid-phase techniques [see, e.g., Stewart et al., Solid-Phase Peptide 
Synthesis , W.H. Freeman Co., San Francisco, CA (1969); Merrifield, J. Am. Chem. Soc. , 
85:2149-2 154 (1963)]. /// vitro protein synthesis may be performed using manual techniques or 
by automation. Automated synthesis may be accomplished, for instance, using an Applied 

30 Biosystcms Peptide Synthesizer (Foster City, CA) using manufacturer's instructions. Various 
portions of the SRT may be chemically synthesized separately and combined using chemical or 
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enzymatic methods to produce the full -length SRT. 

1 . Isolation of DNA Encoding SRT 
DN A encoding SRT may be obtained from a cDNA library prepared from tissue believed 
to possess the SRT mRNA and to express it at a detectable level. Accordingly, human SRT 
5 DNA can be conveniently obtained from a cDNA library prepared from human tissue, such as 
described in the Examples. The SRT-cncoding gene may also be obtained from a genomic 
library or by known synthetic procedures (e.g., automated nucleic acid synthesis). 

Libraries can be screened with probes (such as antibodies to the SRT or oligonucleotides 
of at least about 20-80 bases) designed to identify the gene of interest or the protein encoded by 
10 it. wherein those probes may be based upon the polynucleotide sequences shown in the 
accompanying figures. Screening the cDNA or genomic library with the selected probe may be 
conducted using standard procedures, such as described in Sambrook et ah, Molecular Cloning: 
A Laboratory Manual (New York: Cold Spring Harbor Laboratory Press, 1989). An alternative 
means to isolate the gene encoding SRT is to use PCR methodology [Sambrook et al., supra ; 
1 5 Dieffenbach et al., PCR Primer: A Laboratory Manual (Cold Spring Harbor Laboratory Press, 
1995)]. 

The Examples below describe techniques for screening a cDNA library. The 
oligonucleotide sequences selected as probes should be of sufficient length and sufficiently 
unambiguous that false positives are minimized. The oligonucleotide is preferably labeled such 

20 that it can be detected upon hybridization to DNA in the library being screened. Methods of 
labeling are well known in the art, and include the use of radiolabels like 2 P-labeled ATP, 
biotinylation or enzyme labeling. Hybridization conditions, including moderate stringency and 
high stringency, are provided in Sambrook el al., supra . 

Sequences identified in such library screening methods can be compared and aligned to 

25 other known sequences deposited and available in public databases such as GenBank or other 
private sequence databases. Sequence identity (at either the amino acid or nucleotide level) 
within defined regions of the molecule or across the full-length sequence can be determined using 
methods known in the art and as described herein. 

Nucleic acid having protein coding sequence may be obtained by screening selected 

30 cDNA or genomic libraries using the deduced amino acid sequence disclosed herein for the first 
time, and, if necessary, using conventional primer extension procedures as described in 
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Sumbrook et al., supra , to detect precursors and processing intermediates of mRNA thai may not 
have been reverse-transcribed into cDNA. 

2. Selection and Transformation of Host Cells 
Host cells are transfeetcd or transformed with expression or cloning vectors described 
5 herein for SRT production and cultured in conventional nutrient media modified as appropriate 
for inducing promoters, selecting trans formants, or amplifying the genes encoding the desired 
sequences. The culture conditions, such as media, temperature, pH and the like, can be selected 
by the skilled artisan without undue experimentation. In general, principles, protocols, and 
practical techniques for maximizing the productivity of cell cultures can be found in Mammalian 
10 Cell Biotechnology: a Practical Approach , M. Butler, ed. (IRL Press, 1 99 1 ) and Sambrook ct al., 
supra . 

Methods of cukaryotic cell transfection and prokaryotic cell transformation are known 
to the ordinarily skilled artisan, for example, CaCl 2 . CaP0 4 , liposomc-mcdiatcd and 
electroporation. Depending on the host cell used, transformation is performed using standard 

15 techniques appropriate to such cells. The calcium treatment employing calcium chloride, as 
described in Sambrook et al.. supra , or electroporation is generally used for prokaryotcs. 
Infection with Agrobacterium tumefaciens is used for transformation of certain plant cells, as 
described by Shaw et al., Gene , 23:3 15 (1 983) and WO 89/05859 published 29 June 1 989. For 
mammalian cells without such cell walls, the calcium phosphate precipitation method of Graham 

20 and van der Eb, Virology , 52:456-457 ( 1 978) can be employed. General aspects of mammalian 
cell host system transfections have been described in U.S. Patent No. 4,399,216. 
Transformations into yeast are typically carried out according to the method of Van Solingen et 
al., J. Bact. . 130:946 (1977) and Hsiao et al., Proc. Natl. Acad. Sci. (USA) , 76:3829 (1979). 
However, other methods for introducing DNA into cells, such as by nuclear microinjection, 

25 electroporation, bacterial protoplast fusion with intact cells, or polycations, e.g., polybrene, 
polyornithine, may also be used. For various techniques for transforming mammalian cells, sec 
Keown et al.. Methods in Enzymology . 1 85:527-537 ( 1 990) and Mansouret al.. Nature . 336:348- 
352 (1988). 

Suitable host cells for cloning or expressing the DNA in the vectors herein include 
30 prokaryote, yeast, or higher eukaryote cells. Suitable prokaryotes include but are not limited to 
eubacteria, such as Gram-negative or Gram-positive organisms, for example, Enterobacteriaceae 
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such as coli, Various E. coli strains arc publicly available, such as /:. coli K 1 2 strain MM294 
(ATCC 31,446); /:. coli XI776 (ATCC 31.537); /:. coli strain W31 10 (ATCC 27.325) and K5 
772 (ATCC 53,635). Other suitable prokaryotic host cells include Hntcrobaeteriaeeae such as 
Escherichia, e.g., /; coli, Enterobacter, Erwinia, Klebsiella, Proteus, Salmonella, e.g.. 
Salmonella typhimurium, Serratia, e.g., Serratia marcescans, and Shigella, as well as Bacilli 
5 such as /i. subtilis and tf. lichenijormis (e.g., lichenijormis 41 P disclosed in DD 266,710 
published 12 April 1989), Pseudomonas such as f\ aeruginosa, and Streptomyces. These 
examples arc illustrative rather than limiting. Strain W3 1 10 is one particularly preferred host or 
parent host because it is a common host strain for recombinant DNA product fermentations. 
Preferably, the host cell secretes minimal amounts of proteolytic enzymes. For example, strain 

1 0 W3 1 10 may be modified to effect a genetic mutation in the genes encoding proteins endogenous 
to the host, with examples of such hosts including E. coli W31 10 strain 1A2, which has the 
complete genotype tonA ; E. coli W3 I 1 0 strain 9E4, which has the complete genotype tonA ptr3\ 
E. coli W31 10 strain 27C7 (ATCC 55,244), which has the complete genotype tonA ptrJ phoA 
El 5 (argE-lac)J69 degP ompT kan'\ E. coli W31 10 strain 37D6, which has the complete 

1 5 genotype tonA ptr3 phoA El 5 (argE-lac)I69 degP ompT rbs7 ilvG kan': E. coli W3 1 10 strain 
40B4, which is strain 37D6 with a non-kanamycin resistant degP deletion mutation; and an E. 
coli strain having mutant periplasmic protease disclosed in U.S. Patent No. 4.946,783 issued 7 
August 1990. Alternatively, in vitro methods of cloning, e.g., PCR or other nucleic acid 
polymerase reactions, are suitable. 

20 In addition to prokaryotes, eukaryotic microbes such as filamentous fungi or yeast are 

suitable cloning or expression hosts for SRT-encoding vectors. Saccharomyces cerevisiae is a 
commonly used lower eukaryotic host microorganism. Others include Schizosaccharomyces 
pombe (Beach and Nurse, Nature , 290; 140 11981]; EP 139,383 published 2 May 1985); 
Kluyveromyces hosts (U.S. Patent No. 4,943,529; Fleer et al., Bio/Technology . 9:968-975 ( 1 99 1 )) 

25 such as, e.g., K. lactis (MW98-8C, CBS683, CBS4574; Louvencourt et al., J. Bacteriol. , 737 
[ 1983|), K. frag His (ATCC 12,424), K. bidgaricus (ATCC 16,045), K. wickeramii (ATCC 
24,178), A", waltii (ATCC 56,500), K. drosophiiarum (ATCC 36,906; Van den Berg et al., 
Bio/Technology . 8; 1 35 ( 1 990)), K. thermotolerans, and K. marxianus; yarrow ia (EP 402,226); 
Piclua pastoris (EP 183,070; Sreekrishna et al., J. Basic Microbiol. . 28:265-278 [1988]); 

30 Candida; Trichoderma reesia (EP 244,234); Neurospora crassa (Case et al., Proc. Natl. Acad. 
Sci. USA , 76:5259-5263 [ 1979]); Schwanniomyces such as Schwanniomyces occidentalis (EP 
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394.538 published 31 October 1990): and filamentous fungi such as. e.g.. Neurospora, 
Pcnici/Iium, Tolypocladium (WO 9 1/00357 published 10 January 1991 ). and Aspergillus hosts 
such as A. niththms (Ballance et al., Biochem. Biophys. Res. Commun. , 1 12:284-289 [ 19831: 
Tilburn et al .. Gene, 26: 205-22 1 [ 1 983 ] : Yelton et al.. Proc. Natl. Acad. Sci. USA , 81:1 470- 1 474 
[1984])and/\. niger (Kelly and Hvnes. EMBO J. , 4:475-479 [1985]). Methylotropic yeasts are 
5 suitable herein and include, but are not limited to, yeast capable of growth on methanol selected 
from the genera consisting of Hansenula, Candida, Kloeckcra, Pichia, Saccharomyces, 
Tondopsis, and Rhodotorida. A list of specific species that are exemplary of this class of yeasts 
may be found in C. Anthony, The Biochemistry of Methylotrophs , 269 (1982). 

Suitable host cells for the expression of glycosylated SRT are derived from multicellular 

10 organisms. Examples of invertebrate cells include insect cells such as Drosophila S2 and 
Spodoptera Sf9, as well as plant cells. Examples of useful mammalian host cell lines include 
Chinese hamster ovary (CHO) and COS cells. More specific examples include monkey kidney 
CV l line transformed by SV40 (COS-7, ATCC CRL 165 1 ): human embryonic kidney line (293 
or 293 cells subcloned for growth in suspension culture, Graham et al.. J. Gen Virol. , 36:59 

15 ( 1 977)): Chinese hamster ovary cells/-DHFR (CHO, Urlaub and Chasin, Proc. Natl. Acad. Sci. 
USA . 77:4216 (1980)): mouse Sertoli cells (TM4, Mather, Biol. Reprod. . 23:243-251 (1980)); 
human lung cells (W 1 38, ATCC CCL 75); human liver cells (Hep G2. HB 8065); and mouse 
mammary tumor (MMT 060562, ATCC CCL51). The selection of the appropriate host cell is 
deemed to be within the skill in the art. 

20 

3. Selection and Use of a Repiicable Vector 
The nucleic acid (e.g., cDNA or genomic DNA) encoding SRT may be inserted into a 
repiicable vector for cloning (amplification of the DNA) or for expression. Various vectors are 
publicly available. The vector may, for example, be in the form of a plasmid, cosmid, viral 

25 particle, or phage. The appropriate nucleic acid sequence may be inserted into the vector by a 
variety of procedures. In general, DNA is inserted into an appropriate restriction endonuclease 
sitc(s) using techniques known in the art. Vector components generally include, but are not 
limited to, one or more of a signal sequence, an origin of replication, one or more marker genes, 
an enhancer element, a promoter, and a transcription termination sequence. Construction of 

30 suitable vectors containing one or more of these components employs standard ligation 
techniques which are known to the skilled artisan. 
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The SRT may be produced recombinantly not only directly, but also as a fusion 
polypeptide with a heterologous polypeptide, which may be a signal sequence or other 
polypeptide having a specific cleavage site at the N-terminus of the mature protein or 
polypeptide. In general, the signal sequence may be a component of the vector, or it may be a 
part of the SRT-encoding DNA that is inserted into the vector. The signal sequence may be a 
5 prokaryotic signal sequence selected, lor example, from the group ol the alkaline phosphatase, 
penicillinase, lpp, or heat-stable cnterotoxin II leaders. For yeast secretion the signal sequence 
may be, e.g., the yeast invertase leader, alpha factor leader (including Saccharomyces and 
Kluyveromyces a-factor leaders, the latter described in U.S. Patent No. 5,010,182), or acid 
phosphatase leader, the C. albicans glucoamylasc leader (EP 362,179 published 4 April 1990), 
10 or the signal described in WO 90/13646 published 15 November 1990. In mammalian cell 
expression, mammalian signal sequences may be used to direct secretion of the protein, such as 
signal sequences from secreted polypeptides of the same or related species, as well as viral 
secretory leaders. 

Both expression and cloning vectors contain a nucleic acid sequence that enables the 
15 vector to replicate in one or more selected host cells. Such sequences are well known for a 
variety of bacteria, yeast, and viruses. The origin of replication from the plasmid pBR322 is 
suitable for most Gram-negative bacteria, the 2u plasmid origin is suitable for yeast, and various 
viral origins (SV40, polyoma, adenovirus, VSV or BPV) are useful for cloning vectors in 
mammalian cells. 

20 Expression and cloning vectors will typically contain a selection gene, also termed a 

selectable marker. Typical selection genes encode proteins that (a) confer resistance to 
antibiotics or other toxins, e.g., ampicillin, neomycin, methotrexate, or tetracycline, (b) 
complement auxotrophic deficiencies, or (c) supply critical nutrients not available from complex 
media, e.g., the gene encoding D-alanine racemase for Bacilli. 

25 An example of suitable selectable markers for mammalian cells are those that enable the 

identification of cells competent to take up the SRT-encoding nucleic acid, such as DHFR or 
thymidine kinase. An appropriate host cell when wild-type DHFR is employed is the CHOcell 
line deficient in DHFR activity, prepared and propagated as described by Urlaub ct al., Proc. 
Natl. Acad. Sci. USA , 77:4216(1980). A suitable selection gene for use in yeast is the trp\ gene 

30 present in the yeast plasmid YRp7 [Stincheomb et al., Nature , 282:39 (1979); Kingsman et al., 
Gene , 7: 141 ( 1 979): Tschemperet al.. Gene , 10:157(1980)]. Thc/r/H gene provides a selection 
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marker lor a mutant strain of yeast lacking the ability to grow in tryptophan, for example, ATCC 
No. 44076 or PEP4-1 [Jones, Genetics . 85: 12 (1977)]. 

Expression and cloning vectors usually contain a promoter operably linked to the SRT- 
encoding nucleic acid sequence to direct mRNA synthesis. Promoters recognized by a variety 
of potential host cells are well known. Promoters suitable for use with prokaryotic hosts include 
5 the (^lactamase and lactose promoter systems [Chang et ah, Nature . 275:615 (1978); Goeddel 
et al., Nature . 281:544 (1979)], alkaline phosphatase, a tryptophan (trp) promoter system 
[Goeddel, Nucleic Acids Res. , 8:4057 ( 1 980): EP 36,776], and hybrid promoters such as the tac 
promoter (dcBocr et al., Proc. Natl. Acad. Sci. USA . 80:21-25 (1983)1. Promoters for use in 
bacterial systems also will contain a Shine-Dalgarno (S.D.) sequence operably linked to the DN A 
10 encoding SRT. 

Examples of suitable promoting sequences lor use with yeast hosts include the promoters 
for 3-phosphoglycerate kinase [Hitzcman et al.. J. Biol. Chem. , 255:2073 (1980)1 or other 
glycolytic enzymes [Hess et al., J. Adv. Enzyme Reg ., 7:149 (1968); Holland, Biochemistry , 
17:4900 (1978)], such as enolase, glyeeraldch\de-3-phosphate dehydrogenase, hexokinase, 
15 pyruvate decarboxylase, phosphofructokinase, glucose-6-phosphate isomerase, 3- 
phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose 
isomerase. and glucokinase. 

Other yeast promoters, which are inducible promoters having the additional advantage 
of transcription controlled by growth conditions, are the promoter regions for alcohol 
20 dehydrogenase 2, isocytochrome C, acid phosphatase, degradative enzymes associated with 
nitrogen metabolism, metallothionein, glyceraIdehyde-3-phosphate dehydrogenase, and enzymes 
responsible for maltose and galactose utilization. Suitable vectors and promoters for use in yeast 
expression are further described in EP 73,657. 

SRT transcription from vectors in mammalian host cells is controlled, for example, by 
25 promoters obtained from the genomes of viruses such as polyoma virus, fowl pox virus (UK 
2,21 1,504 published 5 July 1989), adenovirus (such as Adenovirus 2), bovine papilloma virus, 
avian sarcoma virus, cytomegalovirus, a retrovirus, hepatitis-B virus and Simian Virus 40 
(S V40), from heterologous mammalian promoters, e.g., the actin promoter or an immunoglobulin 
promoter, and from heat -shock promoters, provided such promoters are compatible with the host 
30 cell systems. 

Transcription of a DNA encoding the SRT by higher eukaryotes may be increased by 
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inserting an enhancer sequence into the vector. Enhancers are cis-acting elements of DNA. 
usually about from 10 to 300 bp, that act on a promoter to increase its transcription. Many 
enhancer sequences are now known from mammalian genes (globin. elastase, albumin, a- 
fetoprotein, and insulin). Typically, however, one will use an enhancer from a eukaryotic cell 
virus. Examples include the SV40 enhancer on the late side of the replication origin (bp 100- 
5 270). the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the 
replication origin, and adenovirus enhancers. The enhancer may be spliced into the vector at a 
position 5' or 3* to the SRT coding sequence, but is preferably located at a site 5' from the 
promoter. 

Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, 
10 human, or nucleated cells from other multicellular organisms) will also contain sequences 
necessary for the termination of transcription and for stabilizing the mRNA. Such sequences are 
commonly available from the 5' and, occasionally 3\ untranslated regions of eukaryotic or viral 
DNAs or cDNAs. These regions contain nucleotide segments transcribed as polyadenylated 
fragments in the untranslated portion of the mRNA encoding SRT. 
1 5 Still other methods, vectors, and host cells suitable for adaptation to the synthesis of SRT 

in recombinant vertebrate cell culture are described in Gething et al.. Nature, 293:620-625 
(1 98 I ); Mantei et al.. Nature . 28 1 :40-46 ( 1 979); EP 1 17.060; and EP 1 17,058. 

4. Detecting Gene Amplification/Expression 
20 Gene amplification and/or expression may be measured in a sample directly, for example, 

by conventional Southern blotting. Northern blotting to quantitate the transcription of mRNA 
LThomas, Proc. Natl. Acad. Sei. USA . 77:5201-5205 (1980)], dot blotting (DNA analysis), or in 
situ hybridization, using an appropriately labeled probe, based on the sequences provided herein. 
Alternatively, antibodies may be employed that can recognize specific duplexes, including DNA 
25 duplexes, RNA duplexes, and DNA-RNA hybrid duplexes or DNA-protein duplexes. The 
antibodies in turn may be labeled and the assay may be carried out where the duplex is bound to 
a surface, so that upon the formation of duplex on the surface, the presence of antibody bound 
to the duplex can be detected. 

Gene expression, alternatively, may be measured by immunological methods, such as 
30 immunohistochemical staining of cells or tissue sections and assay of cell culture or body fluids, 
to quantitate directly the expression of gene product. Antibodies useful for 
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imniunohistochcmical staining and/or assay of sample fluids may be either monoclonal or 
polyclonal, and may be prepared in any mammal. Conveniently, the antibodies may be prepared 
against a native sequence SRT polypeptide or against a synthetic peptide based on the DNA 
sequences provided herein or against exogenous sequence fused to SRT DNA and encoding a 
specific antibody epitope. 

5 

5. Purification of Polypeptide 
Forms of SRT may be recovered from eulture medium or from host cell lysates. If 
membrane-bound, it can be released from the membrane using a suitable detergent solution (e.g. 
Triton-X 100) or by enzymatic cleavage. Cells employed in expression of SRT can be disrupted 

10 by various physical or chemical means, such as freeze-thaw cycling, sonication, mechanical 
disruption, or cell lysing agents. 

It may be desired to purify SRT from recombinant cell proteins or polypeptides. The 
following procedures are exemplary of suitable purification procedures: by fractionation on an 
ion-exchange column; ethanol precipitation; reverse phase HPLC; chromatography on silica or 

1 5 on a cation-exchange resin such as DEAE; ehromatofocusing; SDS-PAGE; ammonium sulfate 
precipitation; gel filtration using, for example, Sephadex G-75; protein A Sepharose columns to 
remove contaminants such as IgG; and metal chelating columns to bind epitope-tagged forms of 
the SRT. Various methods of protein purification may be employed and such methods are known 
in the art and described for example in Deutscher, Methods in Enzymologv , 1 82 ( 1 990); Scopes, 

20 Protein Purification: Principles and Practice , Springer-Verlag, New York (1982). The 
purification step(s) selected will depend, for example, on the nature of the production process 
used and the particular SRT produced. 

E. Uses for SRT Polynucleotides and Polypeptides 
25 SRT nucleotide sequences (and/or their complements) disclosed herein have various 

applications in the art of molecular biology, including for example uses as hybridization probes, 
in chromosome and gene mapping, in tissue typing, disease tissue detection, in PGR 
technologies, in screening for new therapeutic molecules and in the generation of anti-sense RNA 
and DNA. SRT nucleic acid will also be useful for the preparation of SRT polypeptides by the 
30 recombinant techniques described herein. 

The SRT polynucleotides disclosed herein, or portions thereof, may be used as 
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hybridization probes for a cDNA library to isolate the full-length SRT eDNA or to isolate still 
other cDNAs (for instance, those encoding naturally-occurring variants of SRT or SRT from 
other species) which have a desired sequence identity to the SRT sequence of interest. 
Optionally, the length of the probes will be about 20 to about 50 bases. The hybridization probes 
may be derived from at least partially novel regions of the nucleotide sequences disclosed herein 
5 wherein those regions may be determined without undue experimentation or from genomic 
sequences including promoters, enhancer elements and introns of native sequence SRT. By way 
of example, a screening method will comprise isolating the coding region of the SRT gene using 
the known DNA sequence to synthesize a selected probe of about 40 bases. Hybridization probes 
may be labeled by a variety of labels, including radionucleotides such as ,J P or VS S, or enzymatic 

10 labels such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems. 
Labeled probes having a sequence complementary to that of the SRT gene of the present 
invention can be used to screen libraries of human cDNA, genomic DNA or mRNA to determine 
which members of such libraries the probe hybridizes to. Hybridization techniques are described 
in further detail in the Examples below. 

15 PCR as described in U.S. Pat. Nos. 4,683,195; 4.800.195; and 4.965,188 provides 

additional uses for oligonucleotides based upon the polynucleotide sequences disclosed in the 
accompanying figures. Such oligomers are generally chemically synthesized, but they may be 
of recombinant origin or a mixture of both. Oligomers generally comprise two nucleotide 
sequences, one with sense orientation (5' to 3') and one with antisense (3' to 5") employed under 

20 optimized conditions for identification of a specific gene or diagnostic use. The same two 
oligomers, nested sets of oligomers, or even a degenerate pool of oligomers may be employed 
under less stringent conditions for identification and/or quantitation of closely related DNA or 
RNA sequences. 

Full length genes may be cloned utilizing partial nucleotide sequence and various 
25 methods known in the art. Gobinda et al. PCR Methods Applic. 2:318-322 (1993) disclose 
"restriction-site PCR" as a direct method which uses universal primers to retrieve unknown 
sequence adjacent to a known locus. First, genomic DNA is amplified in the presence of primer 
to linker and a primer specific to the known region. The amplified sequences are subjected to a 
second round of PCR with the same linker primer and another specific primer internal to the first 
30 one. Products of each round of PCR are transcribed with an appropriate RNA polymerase and 
sequenced using reverse transcriptase. Gobinda et al present data concerning Factor IX for which 
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ihcy identified a conserved stretch of 20 nucleotides in the X noncoding region of the gene. 

Inverse PCR is the first method to report successful acquisition of unknown sequences 
starting with primers based on a known region (Trigliaet al.. Nucleic Acids Res. 1 6:8 1 86 ( 1988). 
The method uses several restriction enzymes to generate a suitable fragment in the known region 
of a gene. The fragment is then circularized by intramolecular ligation and used as a PCR 
5 template. Divergent primers are designed from the known region. The multiple rounds of 
restriction enzyme digestions and ligations that are necessary prior to PCR make the procedure 
slow and expensive (Gobinda et al, supra). 

Capture PCR (Lagerstrom et al.. PCR Methods Applic. 1 : 1 1 1-1 19 ( 1991 ) is a method for 
PCR amplification of DNA fragments adjacent to a known sequence in human and YAC DNA. 

10 As noted by Gobinda et al. (supra), capture PCR also requires multiple restriction enzyme 
digestions and ligations to place an engineered double-stranded sequence into an unknown 
portion of the DNA molecule before PCR. Although the restriction and ligation reactions are 
carried out simultaneously, the requirements for extension, immobilization and two rounds of 
PCR and purification prior to sequencing render the method cumbersome and time consuming. 

1 5 Parker et al.. Nucleic Acids Res. 19:3055-3060 ( 1991 ) teach walking PCR, a method for 

targeted gene walking which permits retrieval of unknown sequence. PromoterFinder™ is a new 
kit available from Clontech (Palo Alto, Calif.) which uses PCR and primers derived from p53 
to walk in genomic DNA. Nested primers and special PromoterFinder libraries are used to detect 
upstream sequences such as promoters and regulatory elements. This process avoids the need to 

20 screen libraries and is useful in finding intron/exon junctions. 

Another new PCR method, "Improved Method for Obtaining Full Length cDNA 
Sequences" (see U.S. Patent No. 5,817,479, issued October 6, 1998), employs XL-PCR 
(Perkin-Elmer, Foster City. Calif.) to amplify and extend partial nucleotide sequence into longer 
pieces of DNA. This method was developed to allow a single researcher to process multiple 

25 genes (up to 20 or more) at one time and to obtain an extended (possibly full-length) sequence 
within 6-10 days. This new method replaces methods which use labelled probes to screen 
plasmid libraries and allow one researcher to process only about 3-5 genes in 14-40 days. 

In the first step, which can be performed in about two days, any two of a plurality of 
primers are designed and synthesized based on a known partial sequence. In step 2, which takes 

30 about six to eight hours, the sequence is extended by PCR amplification of a selected library. 
Steps 3 and 4, which take about one day. are purification of the amplified cDNA and its ligation 
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into an appropriate vector. Step 5, which takes about one day, involves transforming and growing 
up host bacteria. In step 6, which takes approximately five hours, PCR is used to screen bacterial 
clones for extended sequence. The final steps, which take about one day. involve the preparation 
and sequencing of selected clones. 

If the full length cDNA has not been obtained, the entire procedure is repeated using 
5 either the original library or some other preferred library. The preferred library may be one that 
has been size-selected to include only larger cDNAs or may consist of single or combined 
commercially available libraries, eg. lung, liver, heart and brain from Gibeo/BRL (Gaithersburg, 
Md.). The cDNA library may have been prepared with oligo (dT) or random priming. Random 
primed libraries are preferred in that they will contain more sequences which contain 5' ends of 

10 genes. A randomly primed library may be particularly useful if an oligo (dT) library does not 
yield a complete gene. 

The nucleotide sequence for any particular polynucleotide shown in the accompanying 
figures can also be used to generate probes for mapping the native genomic sequence. The 
sequence may be mapped to a particular chromosome or to a specific region of the chromosome 

15 using well known techniques. These include in situ hybridization to chromosomal spreads 
(Verma et al., "Human Chromosomes: A Manual of Basic Techniques**, Pergamon Press, New 
York City, 1 988), flow-sorted chromosomal preparations, or artificial chromosome constructions 
such as yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial 
PI constructions or single chromosome cDNA libraries. 

20 In situ hybridization of chromosomal preparations and physical mapping techniques such 

as linkage analysis using established chromosomal markers are invaluable in extending genetic 
maps. Examples of genetic maps can be found in the 1994 Genome Issue of Science (265: 198 1 Q. 
Often the placement of a gene on the chromosome of another mammalian species may reveal 
associated markers even if the number or arm of a particular human chromosome is not known. 

25 New partial nucleotide sequences can be assigned to chromosomal arms, or parts thereof, by 
physical mapping. This provides valuable information to investigators searching for disease 
genes using positional cloning or other gene discovery techniques. Once a disease or syndrome, 
such as ataxia telangiectasia (AT), has been crudely localized by genetic linkage to a particular 
genomic region, for example, AT to 1 lq22-23 (Gatti et al., Nature 336:577-580 (1988), any 

30 sequences mapping to that area may represent genes for further investigation. The nucleotide 
sequences ol the subject invention may also be used to detect differences in the chromosomal 
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location of nucleotide sequences due to translocation, inversion, etc., between norma) and carrier 
or affected individuals. 

The partial nucleotide sequence encoding a particular SRT polypeptide may be used to 
produce an amino acid sequence using well known methods of recombinant DNA technology. 
The amino acid or peptide may be expressed in a variety of host cells, either prokaryotic or 
5 eukaryotic. Host cells may be from the same species from which the nucleotide sequence was 
derived or from a different species. Advantages of producing an amino acid sequence or peptide 
by recombinant DNA technology include obtaining adequate amounts for 
purification and the availability of simplified purification procedures. 

Cells transformed with an SRT nucleotide sequence may be cultured under conditions 

10 suitable for the expression and recovery of peptide from cell culture as described above. The 
peptide produced by a recombinant cell may be secreted or may be contained intracellularly 
depending on the sequence itself and/or the vector used. In general, it is more convenient to 
prepare recombinant proteins in secreted form, and this is accomplished by ligating SRT to a 
recombinant nucleotide sequence which directs its movement through a particular prokaryotic 

15 or eukaryotic cell membrane. Other recombinant constructions may join SRT to nucleotide 
sequence encoding a polypeptide domain which will facilitate protein purification (Kroll et al., 
DNA Cell Biol. 12:441-53 (1993). 

Other useful fragments of the SRT nucleic acids include antisense or sense 
oligonucleotides comprising a singe-stranded nucleic acid sequence (either RNA or DNA) 

20 capable of binding to target SRT mRNA (sense) or SRT DNA (antisense) sequences. Antisense 
or sense oligonucleotides, according to the present invention, comprise a fragment of the coding 
region of SRT DNA. Such a fragment generally comprises at least about 14 nucleotides, 
preferably from about 14 to 30 nucleotides. The ability to derive an antisense or a sense 
oligonucleotide, based upon a cDNA sequence encoding a given protein is described in, for 

25 example, Stein and Cohen ( Cancer Res. 48:2659, 1988) and van der Krol et al. ( BioTechniques 
6:958, 1988). 

Binding of antisense or sense oligonucleotides to target nucleic acid sequences results in 
the formation of duplexes that block transcription or translation of the target sequence by one of 
several means, including enhanced degradation of the duplexes, premature termination of 
30 transcription or translation, or by other means. The antisense oligonucleotides thus may be used 
to block expression of SRT proteins. Antisense or sense oligonucleotides further comprise 
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oligonucleotides having modified sugar-phosphodiester backbones (or other sugar linkages, such 
as those described in WO 9 1 /06629) and wherein such sugar linkages arc resistant to endogenous 
nucleases. Such oligonucleotides with resistant sugar linkages are stable /'// vivo (i.e., capable of 
resisting enzymatic degradation) but retain sequence specificity to be able to bind to target 
nucleotide sequences. 

5 Other examples of sense or antisense oligonucleotides include those oligonucleotides 

which are covaienlly linked to organic moieties, such as those described in WO 90/10048, and 
other moieties that increases affinity of the oligonucleotide for a target nucleic acid sequence, 
such as poly-(L-lysine). Further still, intercalating agents, such as ellipticine, and alkylating 
agents or metal complexes may be attached to sense or antisense oligonucleotides to modify 

1 0 binding specificities of the antisense or sense oligonucleotide for the target nucleotide sequence. 

Antisense or sense oligonucleotides may be introduced into a cell containing the target 
nucleic acid sequence by any gene transfer method, including, for example, CaP0 4 -mediated 
DNA transfection, electroporation, or by using gene transfer vectors such as Epstcin-Barr virus. 
In a preferred procedure, an antisense or sense oligonucleotide is inserted into a suitable 

15 retroviral vector. A cell containing the target nucleic acid sequence is contacted with the 
recombinant retroviral vector, either in vivo or ex vivo. Suitable retroviral vectors include, but 
arc not limited to, those derived from the murine retrovirus M-MuLV, N2 (a retrovirus derived 
from M-MuLV), or the double copy vectors designated DCT5A, DCT5B and DCT5C (see WO 
90/13641). 

20 Sense or antisense oligonucleotides also may be introduced into a cell containing the 

target nucleotide sequence by formation of a conjugate with a ligand binding molecule, as 
described in WO 91/04753. Suitable ligand binding molecules include, but arc not limited to, 
cell surface receptors, growth factors, other cytokines, or other ligands that bind to cell surface 
receptors. Preferably, conjugation of the ligand binding molecule does not substantially interfere 

25 with the ability of the ligand binding molecule to bind to its corresponding molecule or receptor, 
or block entry of the sense or antisense oligonucleotide or its conjugated version into the cell. 

Alternatively, a sense or an antisense oligonucleotide may be introduced into a cell 
containing the target nucleic acid sequence by formation of an oligonucleotide-lipid complex, 
as described in WO 90/10448. The sense or antisense oligonucleotide-lipid complex is 

30 preferably dissociated within the cell by an endogenous lipase. 

The probes may also be employed in PCR techniques to generate a pool of sequences for 
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identification of closely related SRT coding sequences. 

Nucleotide sequences encoding an SRT can also be used to construct hybridization probes 
for mapping the gene which encodes that SRT and for the genetic analysis of individuals with 
genetic disorders. The nucleotide sequences provided herein may be mapped to a chromosome 
and specific regions of a chromosome using known techniques, such as /'// situ hybridization, 
5 linkage analysis against known chromosomal markers, and hybridization screening with libraries. 

When the coding sequences for SRT encode a protein which binds to another protein 
(example, where the SRT is a receptor), the SRT can be used in assays to identify the other 
proteins or molecules involved in the binding interaction. By such methods, inhibitors of the 
receptor/ligand binding interaction can be identified. Proteins involved in such binding 

1 0 interactions can also be used to screen for peptide or small molecule inhibitors or agonists of the 
binding interaction. Also, the receptor SRT can be used to isolate correlative ligand(s). 
Screening assays can be designed to find lead compounds that mimic the biological activity of 
a native SRT or a receptor for SRT. Such screening assays will include assays amenable to high- 
throughput screening of chemical libraries, making them particularly suitable for identifying 

15 small molecule drug candidates. Small molecules contemplated include synthetic organic or 
inorganic compounds. The assays can be performed in a variety of formats, including protein- 
protein binding assays, biochemical screening assays, immunoassays and cell based assays, 
which are well characterized in the art. 

Nucleic acids which encode SRT or its modified forms can also be used to generate either 

20 transgenic animals or "knock out" animals which, in turn, are useful in the development and 
screening of therapeutically useful reagents. A transgenic animal (e.g., a mouse or rat) is an 
animal having cells that contain a transgene, which transgenc was introduced into the animal or 
an ancestor of the animal at a prenatal, e.g., an embryonic stage. A transgene is a DNA which 
is integrated into the genome of a cell from which a transgenic animal develops. In one 

25 embodiment, cDNA encoding SRT can be used to clone genomic DNA encoding SRT in 
accordance with established techniques and the genomic sequences used to generate transgenic 
animals that contain cells which express DNA encoding SRT. Methods for generating transgenic 
animals, particularly animals such as mice or rats, have become conventional in the art and are 
described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009. Typically, particular cells 

30 would be targeted for SRT transgene incorporation with tissue-specific enhancers. Transgenic 
animals that include a copy of a transgene encoding SRT introduced into the germ line of the 
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animal at an embryonic stage can be used to examine the effect of increased expression of DNA 
encoding SRT. Such animals can be used as tester animals for reagents thought to confer 
protection from, for example, pathological conditions associated with its overexpression. In 
accordance with this facet of the invention, an animal is treated with the reagent and a reduced 
incidence of the pathological condition, compared to untreated animals bearing the transgene, 
5 would indicate a potential therapeutic intervention for the pathological condition. 

Alternatively, non-human homologucs of SRT can be used to construct a SRT "knock 
out" animal which has a defective or altered gene encoding SRT as a result of homologous 
recombination between the endogenous gene encoding SRT and altered genomic DNA encoding 
SRT introduced into an embryonic stem cell of the animal. For example, cDNA encoding SRT 

10 can be used to clone genomic DNA encoding SRT in accordance with established techniques. 
A portion of the genomic DNA encoding SRT can be deleted or replaced with another gene, such 
as a gene encoding a selectable marker which can be used to monitor integration. Typically, 
several kilobases of unaltered Hanking DNA (both at the 5'and 3'ends) are included in the vector 
(see e.g., Thomas and Capecehi, Cell , 51:503 (1987) for a description of homologous 

15 recombination vectors]. The vector is introduced into an embryonic stem cell line (e.g., by 
electroporation) and cells in which the introduced DNA has homologously recombined with the 
endogenous DNA are selected [see e.g., Li et ah, Cell , 69:915 (1992)]. The selected cells are 
then injected into a blastocyst of an animal (e.g., a mouse or rat) to form aggregation chimeras 
[see e.g., Bradley, in Teratocarcinomas and Embryonic Stem Cells: A Practical Approach, E. 

20 J. Robertson, ed. (IRL, Oxford, 1987), pp. 1 1 3-152]. A chimeric embryo can then be implanted 
into a suitable pseudopregnant female foster animal and the embryo brought to term to create a 
"knock out" animal. Progeny harboring the homologously recombined DNA in their germ cells 
can be identified by standard techniques and used to breed animals in which all cells of the 
animal contain the homologously recombined DNA. Knockout animals can be characterized for 

25 instance, for their ability to defend against certain pathological conditions and for their 
development of pathological conditions due to absence of the SRT polypeptide. 

Nucleic acid encoding the SRT polypeptides may also be used in gene therapy. In gene 
therapy applications, genes are introduced into cells in order to achieve in vivo synthesis of a 
therapeutically effective genetic product, for example for replacement of a defective gene. "Gene 

30 therapy" includes both conventional gene therapy where a lasting effect is achieved by a single 
treatment, and the administration of gene therapeutic agents, which involves the one time or 
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repeated administration of a therapeutically effective DNA or mRNA. Antisense RNAs and 
DNAs can be used as therapeutic agents for blocking the expression of certain genes in vivo. It 
has already been shown that short antisense oligonucleotides can be imported into cells where 
they act as inhibitors, despite their low intracellular concentrations caused by their restricted 
uptake by the cell membrane. (Zamecnik et ah, Proc. Natl. Acad. Sci. USA 83:4143-4146 
5 [ 1986]). The oligonucleotides can be modified to enhance their uptake, e.g. by substituting their 
negatively charged phosphodiester groups by uncharged groups. 

There are a variety of techniques available for introducing nucleic acids into viable cells. 
The techniques vary depending upon whether the nucleic acid is transferred into cultured cells 
in vitro* or in vivo in the cells of the intended host. Techniques suitable for the transfer of nucleic 

1 0 acid into mammalian cells in vitro include the use of liposomes, elcctroporation, microinjection, 
cell fusion. DEAE-dcxtran, the calcium phosphate precipitation method, etc. The currently 
preferred /'// vivo gene transfer techniques include transfection with viral (typically retroviral) 
vectors and viral coat protein- liposome mediated transfection (Dzau et al., Trends in 
Biotechnology 1 1, 205-2 10 [ 1993]). In some situations it is desirable to provide the nucleic acid 

1 5 source with an agent that targets the target cells, such as an antibody specific for a cell surface 
membrane protein or the target cell, a ligand for a receptor on the target cell, etc. Where 
liposomes are employed, proteins which bind to a cell surface membrane protein associated with 
endocytosis may be used for targeting and/or to facilitate uptake, e.g. capsid proteins or 
fragments thereof tropic for a particular cell type, antibodies for proteins which undergo 

20 internalization in cycling, proteins that target intracellular localization and enhance intracellular 
half-life. The technique of receptor-mediated endocytosis is described, for example, by Wu et 
ah, J. Biol. Chcm. 262, 4429-4432 (1987); and Wagner et al., Proc. Natl. Acad. Sci. USA 87, 
3410-3414 (1990). For review of gene marking and gene therapy protocols see Anderson et al., 
Science 256, 808-8 13 (1 992). 

25 The SRT polypeptides described herein may also be employed as molecular weight 

markers for protein electrophoresis purposes. 

The nucleic acid molecules encoding the SRT polypeptides or fragments thereof 
described herein are useful for chromosome identification. In this regard, there exists an ongoing 
need to identify new chromosome markers, since relatively few chromosome marking reagents, 

30 based upon actual sequence data are presently available. Each SRT nucleic acid molecule of the 
present invention can be used as a chromosome marker. 
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The SRT polypeptides and nucleic aeid molecules of the present invention may also be 
used for tissue typing, wherein the SRT polypeptides of the present invention may be 
differentially expressed in one tissue as compared to another, for example in a diseased tissue 
versus a normal tissue. SRT nucleic acid molecules will find use for generating probes for PCR. 
Northern analysis. Southern analysis and Western analysis. 
5 The SRT polypeptides described herein and antibodies thereagainst may also be employed 

as therapeutic agents. The SRT polypeptides of the present invention can be formulated 
according to known methods to prepare pharmaceutically useful compositions, whereby the SRT 
product hereof is combined in admixture with a pharmaceutically acceptable carrier vehicle. 
Therapeutic formulations are prepared for storage by mixing the active ingredient having the 

1 0 desired degree of purity with optional physiologically acceptable carriers, excipients or stabilizers 
( Remingtons Pharmaceutical Sciences 16th edition. Osol, A. Ed. (1980)), in the form of 
lyophilized formulations or aqueous solutions. Acceptable carriers, excipients or stabilizers are 
nontoxic to recipients at the dosages and concentrations employed, and include buffers such as 
phosphate, citrate and other organic acids; antioxidants including ascorbic acid; low molecular 

15 weight (less than about 10 residues) polypeptides; proteins, such as serum albumin, gelatin or 
immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone, amino acids such as 
glycine, glutamine, asparagine, arginine or lysine; monosaccharides, disaccharides and other 
carbohydrates including glucose, mannose, or dextrins; chelating agents such as EDTA; sugar 
alcohols such as mannitol or sorbitol; salt-forming counterions such as sodium; and/or nonionic 

20 surfactants such as TWEEN™, PLURONICS™ or PEG. 

The formulations to be used for /// vivo administration must be sterile. This is readily 
accomplished by filtration through sterile filtration membranes, prior to or following 
lyophilization and reconstitution. 

Therapeutic compositions herein generally arc placed into a container having a sterile 

25 access port, for example, an intravenous solution bag or vial having a stopper pierceable by a 
hypodermic injection needle. 

The route of administration is in accord with known methods, e.g. injection or infusion 
by intravenous, intraperitoneal, intracerebral, intramuscular, intraocular, intraarterial or 
intralesional routes, topical administration, or by sustained release systems. 

30 Dosages and desired drug concentrations of pharmaceutical compositions of the present 

invention may vary depending on the particular use envisioned. The determination of the 
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appropriate dosage or route of administration is well within the skill of an ordinary physician. 
Animal experiments provide reliable guidance for the determination of effective doses for human 
therapy. Interspecies scaling of effective doses can be performed following the principles laid 
down by Mordenti, J. and Chappcll, W. "The use of interspecies scaling in toxicokinetics" In 
Toxicokinetics and New Drug Development. Yacobi et al., Eds., Pergamon Press, New York 
5 1989, pp. 42-96. 

When in vivo administration of a SRT polypeptide or agonist or antagonist thereof is 
employed, normal dosage amounts may vary from about 1 0 ng/kg to up to 1 00 mg/kg of mammal 
body weight or more per day, preferably about 1 ug/kg/day to 10 mg/kg/day, depending upon the 
route of administration. Guidance as to particular dosages and methods of delivery is provided 

10 in the literature; see, for example, U.S. Pat. Nos. 4,657,760; 5.206.344; or 5,225,212. It is 
anticipated that different formulations will be effective for different treatment compounds and 
different disorders, that administration targeting one organ or tissue, for example, may necessitate 
delivery in a manner different from that to another organ or tissue. 

Where sustained-release administration of a SRT polypeptide is desired in a formulation 

15 with release characteristics suitable for the treatment of any disease or disorder requiring 
administration of the SRT polypeptide, microencapsulation of the SRT polypeptide is 
contemplated. Microencapsulation of recombinant proteins for sustained release has been 
successfully performed with human growth hormone (rhGH), interferon- (rhIFN- ), interleukin-2, 
and MN rgp 1 20. Johnson et al., Nat. Med. . 2:795-799 ( 1 996); Yasuda, Biomed. Ther. , 27:1221- 

20 1223 (1993); Hora et al.. Bi o/Technolog y. 8:755-758 (1990); Cleland, "Design and Production 
of Single Immunization Vaccines Using Polylactide Polyglycolide Microsphere Systems," in 
Vaccine Design: The Subunit and Adjuvant Approach , Powell and Newman, eds, (Plenum Press: 
New York, 1995), pp. 439-462; WO 97/03692, WO 96/40072, WO 96/07399: and U.S. Pat. No. 
5,654,010. 

25 The sustained-release formulations of these proteins were developed using poly-lactic- 

coglycolic acid (PLGA) polymer due to its biocompatibility and wide range of biodegradable 
properties. The degradation products of PLGA. lactic and glycolic acids, can be cleared quickly 
within the human body. Moreover, the degradability of this polymer can be adjusted from 
months to years depending on its molecular weight and composition. Lewis, "Controlled release 

30 of bioactive agents from lactide/glycolide polymer," in: M. Chasin and R. Langer (Eds.), 
Biodegradable Polymers as Drug Delivery Systems (Marcel Dekker: New York, 1 990), pp. 1-41. 
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This invention encompasses methods of screening compounds to identify those that 
mimic the SRT polypeptide (agonists) or prevent the effect of the SRT polypeptide (antagonists). 
Screening assays for antagonist drug candidates are designed to identify compounds that bind or 
complex with the SRT polypeptides encoded by the genes identified herein, or otherwise interfere 
with the interaction of the encoded polypeptides with other cellular proteins. Such screening 
5 assays will include assays amenable to high-throughput screening of chemical libraries, making 
them particularly suitable for identifying small molecule drug candidates. 

The assays can be performed in a variety of formats, including protein-protein binding 
assays, biochemical screening assays, immunoassays, and cell-based assays, which are well 
characterized in the art. 

1 0 All assays for antagonists are common in that they call for contacting the drug candidate 

with a SRT polypeptide encoded by a nucleic acid identified herein under conditions and for a 
time sufficient to allow these two components to interact. 

In binding assays, the interaction is binding and the complex formed can be isolated or 
detected in the reaction mixture. In a particular embodiment, the SRT polypeptide encoded by 

15 the gene identified herein or the drug candidate is immobilized on a solid phase, e.g., on a 
microliter plate, by covalent or non-covalent attachments. Non-covalent attachment generally 
is accomplished by coating the solid surface with a solution of the SRT polypeptide and drying. 
Alternatively, an immobilized antibody, e.g., a monoclonal antibody, specific for the SRT 
polypeptide to be immobilized can be used to anchor it to a solid surface. The assay is performed 

20 by adding the non-immobilized component, which may be labeled by a detectable label, to the 
immobilized component, e.g., the coated surface containing the anchored component. When the 
reaction is complete, the non-reacted components are removed, e.g., by washing, and complexes 
anchored on the solid surface are detected. When the originally non-immobilized component 
carries a detectable label, the detection of label immobilized on the surface indicates that 

25 complexing occurred. Where the originally non-immobilized component does not carry a label, 
complexing can be detected, for example, by using a labeled antibody specifically binding the 
immobilized complex. 

If the candidate compound interacts with but does not bind to a particular SRT 
polypeptide encoded by a gene identified herein, its interaction with that polypeptide can be 

30 assayed by methods well known for detecting protein-protein interactions. Such assays include 
traditional approaches, such as, e.g., cross-linking, co-inimunopreeipitation, and co-purification 
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through gradients or chromatographic columns. In addition, protein-protein interactions can be 
monitored by using a yeast-based genetic system described by Fields and co-workers (Fields and 
Song, Nature (London) . 340:245-246 ( 1 989); Chien et al.. Proc. Natl. Acad. Sci. USA . 88:9578- 
9582 ( 1991 )) as disclosed by Chevray and Nathans, Proc. Natl. Acad. Sci. USA . 89: 5789-5793 
( 1 99 1 ). Many transcriptional activators, such as yeast GAL4. consist of two physically discrete 
5 modular domains, one acting as the DNA-binding domain, the other one functioning as the 
transcription-activation domain. The yeast expression system described in the foregoing 
publications (generally referred to as the "two-hybrid system") takes advantage of this property, 
and employs two hybrid proteins, one in which the target protein is fused to the DNA-binding 
domain of GAL4, and another, in which candidate activating proteins are fused to the activation 

10 domain. The expression of a GALl/^/rZ reporter gene under control of a GAL4-activated 
promoter depends on reconstitution of GAL4 activity via protein-protein interaction. Colonies 
containing interacting polypeptides are detected with a chromogenic substrate for (3- 
galactosidase. A complete kit (MATCHMAKER™) for identifying protein-protein interactions 
between two specific proteins using the two hybrid technique is commercially available from 

1 5 Clontech. This system can also be extended to map protein domains involved in specific protein 
interactions as well as to pinpoint amino acid residues that are crucial for these interactions. 

Compounds that interfere with the interaction of a gene encoding a SRT polypeptide 
identified herein and other intra- or extracellular components can be tested as follows: usually 
a reaction mixture is prepared containing the product of the gene and the intra- or extracellular 

20 component under conditions and for a time allowing for the interaction and binding of the two 
products. To test the ability of a candidate compound to inhibit binding, the reaction is run in 
the absence and in the presence of the test compound. In addition, a placebo may be added to 
a third reaction mixture, to serve as positive control. The binding (complex formation) between 
the test compound and the intra- or extracellular component present in the mixture is monitored 

25 as described hereinabove. The formation of a complex in the control reaction(s) but not in the 
reaction mixture containing the test compound indicates that the test compound interferes with 
the interaction of the test compound and its reaction partner. 

To assay for antagonists, the SRT polypeptide may be added to a cell along with the 
compound to be screened for a particular activity and the ability of the compound to inhibit the 

30 activity of interest in the presence of the SRT polypeptide indicates that the compound is an 
antagonist to the SRT polypeptide. Alternatively, antagonists may be detected by combining the 
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SRT polypeptide and a potential antagonist with membrane-bound SRT polypeptide reeeptors 
or recombinant reeeptors under appropriate conditions for a competitive inhibition assay. The 
SRT polypeptide can be labeled, such as by radioactivity, such that the number of SRT 
polypeptide molecules bound to the receptor can be used to determine the effectiveness of the 
potential antagonist. The gene encoding the receptor can be identified by numerous methods 
5 known to those of skill in the art, for example, ligand panning and FACS sorting. Coligan et al., 
Current Protocols in Immun. , 1(2): Chapter 5 ( 1991 ). Preferably, expression cloning is employed 
wherein polyadenylated RNA is prepared from a cell responsive to the SRT polypeptide and a 
cDNA library created from this RNA is divided into pools and used to transfect COS cells or 
other cells that are not responsive to the SRT polypeptide. Transfected cells that are grown on 

1 0 glass slides are exposed to labeled SRT polypeptide. The SRT polypeptide can be labeled by a 
variety of means including iodination or inclusion of a recognition site for a site-specific protein 
kinase. Following fixation and incubation, the slides are subjected to autoradiographic analysis. 
Positive pools are identified and sub-pools arc prepared and re-transfecled using an interactive 
sub-pooling and re-screening process, eventually yielding a single clone that encodes the putative 

1 5 receptor. 

As an alternative approach for receptor identification, labeled SRT polypeptide can be 
photoaffinity-linked with cell membrane or extract preparations that express the receptor 
molecule. Cross-linked material is resolved by PAGE and exposed to X-ray film. The labeled 
complex containing the receptor can be excised, resolved into peptide fragments, and subjected 

20 to protein micro-sequencing. The amino acid sequence obtained from micro- sequencing would 
be used to design a set of degenerate oligonucleotide probes to screen a cDNA library to identify 
the gene encoding the putative receptor. 

In another assay for antagonists, mammalian cells or a membrane preparation expressing 
the receptor would be incubated with labeled SRT polypeptide in the presence of the candidate 

25 compound. The ability of the compound to enhance or block this interaction could then be 
measured. 

More specific examples of potential antagonists include an oligonucleotide that binds to 
the fusions of immunoglobulin with SRT polypeptide, and, in particular, antibodies including, 
without limitation, poly- and monoclonal antibodies and antibody fragments, single-chain 
30 antibodies, anti-idiotypic antibodies, and chimeric or humanized versions of such antibodies or 
fragments, as well as human antibodies and antibody fragments. Alternatively, a potential 
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antagonist may be a closely related protein, for example, a mutated form of the SRT polypeptide 
that recognizes the receptor but imparts no effect, thereby competitively inhibiting the action of 
the SRT polypeptide. 

Another potential SRT polypeptide antagonist is an untisense RNA or DNA construct 
prepared using ant i sense technology, where, e.g., an untisense RNA or DNA molecule acts to 
5 block directly the translation of mRNA by hybridizing to targeted mRNA and preventing protein 
translation. Anlisense technology can be used to control gene expression through triple-helix 
formation or anlisense DNA or RNA, both of which methods are based on binding of a 
polynucleotide to DNA or RNA. For example, the 5' coding portion of the polynucleotide 
sequence, which encodes the mature SRT polypeptides herein, is used to design an anlisense 

10 RNA oligonucleotide of from about 10 to 40 base pairs in length. A DNA oligonucleotide is 
designed to be complementary to a region of the gene involved in transcription (triple helix - see 
Lee ct aL Nucl. Acids Res. , 6:3073 (1979); Cooney el al., Science , 241 : 456 (1988); Dervan et 
al.. Science , 25 1 : 1 360 (1991 )), thereby preventing transcription and the production of the SRT 
polypeptide. The antisense RNA oligonucleotide hybridizes to the mRNA in vivo and blocks 

1 5 translation of the mRNA molecule into the SRT polypeptide (antisense - Okano, Neurochem, . 
56:560 ( 1 99 1 ); Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression (CRC Press: 
Boca Raton, FL, 1 988). The oligonucleotides described above can also be delivered to cells such 
that the antisense RNA or DNA may be expressed in vivo to inhibit production of the SRT 
polypeptide. When antisense DNA is used, oligodeoxyribonucleotidcs derived from the 

20 translation-initiation site, e.g., between about - 10 and +1 0 positions of the target gene nucleotide 
sequence, are preferred. 

Potential antagonists include small molecules that bind to the active site, the receptor 
binding site, or growth factor or other relevant binding site of the SRT polypeptide, thereby 
blocking the normal biological activity of the SRT polypeptide. Examples of small molecules 

25 include, but are not limited to, small peptides or peptidc-like molecules, preferably soluble 
peptides, and synthetic non-peptidyl organic or inorganic compounds. 

Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of 
RNA. Ribozymes act by sequence-specific hybridization to the complementary target RNA, 
followed by cndonucleolytic cleavage. Specific ribozymc cleavage sites within a potential RNA 

30 target can be identified by known techniques. For further details see, e.g., Rossi, Current 
Biology , 4:469-471 (1994), and PCT publication No. WO 97/33551 (published September 18, 
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1997). 

Nucleic acid molecules in triple-helix formation used to inhibit transcription should be 
single-stranded and composed of deoxynucleotidcs. The base composition of these 
oligonucleotides is designed such that it promotes triple-helix formation via Hoogsteen base- 
pairing rules, which generally require sizeable stretches of purines or pyrimidincs on one strand 
5 of a duplex. For further details see, e.g., PCT publication No. WO 97/3355 1 , supra. 

These small molecules can be identified by any one or more of the screening assays 
discussed hereinabove and/or by any other screening techniques well known for those skilled in 
the art. 

10 F. Anti-SRT Polypeptide Antibodies 

The present invention further provides anti-SRT antibodies. Exemplary antibodies 
include polyclonal, monoclonal, humanized, bispecific, and heteroconjugate antibodies. 

1 . Polyclonal Antibodies 

15 The anti-SRT antibodies may comprise polyclonal antibodies. Methods of preparing 

polyclonal antibodies are known to the skilled artisan. Polyclonal antibodies can be raised in a 
mammal, for example, by one or more injections of an immunizing agent and, if desired, an 
adjuvant. Typically, the immunizing agent and/or adjuvant will be injected in the mammal by 
multiple subcutaneous or intraperitoneal injections. The immunizing agent may include the SRT 

20 polypeptide or a fusion protein thereof. It may be useful to conjugate the immunizing agent to 
a protein known to be immunogenic in the mammal being immunized. Examples of such 
immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum albumin, 
bovine thyroglobulin, and soybean trypsin inhibitor. Examples of adjuvants which may be 
employed include Freund s complete adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid 

25 A, synthetic trehalose dicorynomycolate). The immunization protocol may be selected by one 
skilled in the art without undue experimentation. 

2. Monoclonal Antibodies 

The anti-SRT antibodies may, alternatively, be monoclonal antibodies. Monoclonal 
30 antibodies may be prepared using hybridoma methods, such as those described by Kohlcr and 
Milstcin, Nature, 256 :495 (1 975). In a hybridoma method, a mouse, hamster, or other 
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appropriate host animal, is typically immunized with an immunizing agent to elicit lymphocytes 
that produce or are capable of producing antibodies that will specifically bind to the immunizing 
agent. Alternatively, the lymphocytes may be immunized in vitro. 

The immunizing agent will typically include the SRT polypeptide or a fusion protein 
thereof. Generally, either peripheral blood lymphoey.es ("PBLs") are used if cells of human 
origin are desired, or spleen cells or lymph node cells are used if non-human mammalian sources 
are'desired. The lymphocytes are then fused with an immortalized cell line using a suitable 
fusing agent, such as polyethylene glycol, to torn, a hybridoma cell [Coding, Monoclonal 
A,„ i h,vli,>,Pnn lin lesand Practice. Academic Press. ( 1 986, pp. 59- 1 03 ]. Immortalized cell fines 
are usually transformed mammalian cells, particularly myeloma cells of rodent, bovine and 
human origin. Usually, rat or mouse myeloma cell fines are employed. The hybridoma cells may 
be cultured in a suitable culture medium that preferably contains one or more substances that 
inhibit the growth or survival of the unfused. immortalized cells. For example, if the parental 
eells lack the enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT). the 
culture medium for the hybndomas typically will include hypoxanthine. aminopterin, and 
thymidine ("HAT medium"), which substances prevent the growth of HGPRT-dcficicnt cells. 

Preferred immortalized cell lines are those that fuse efficiently, support stable high level 
expression of antibody by the selected antibody-producing cells, and are sensitive to a medium 
such as HAT medium. More preferred immortalized cell lines are murine myeloma lines, which 
can be obtained, for instance, from the Salk Institute Cell Distribution Center, San Diego, 
California and the American Type Culture Collection. Manassas. Virginia. Human myeloma and 
mouse-human he.eromyeloma cell lines also have been described for the production of human 
monoclonal antibodies IKozbor, TJmmunoL- 133:3001 (.984); Brodeur e, al.. M^oclonal 
Antibodv. Production Techniques and Applicatio ns. Marcel Deleter. Inc., New York. (1987) pp. 
51-63]. 

The culture medium in which the hybridoma cells are cultured can then be assayed for 
the presence of monoclonal antibodies directed against SRT. Preferably, the binding specificity 
of monoclonal antibodies produced by the hybridoma cells is determined by immunoprecpi.ation 
or by an in vitro binding assay, such as radioimmunoassay (RIA) or enzyme-linked 
immunoabsorbent assay (ELISA). Such techniques and assays are known in the art. The binding 
affinity of the monoclonal antibody can. for example, be determined by the Scatchard analysis 
of Munson and Pollard. Anal. Biochem. . 107:220 (1980). 
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After the desired hybridoma eel Is are identified, the clones may be subcloned by limiting 
dilution procedures and grown by standard methods I Coding, supra ]. Suitable culture media for 
this purpose include, for example, Dulbeccos Modified Eagle's Medium and RPMI-1040 
medium. Alternatively, the hybridoma cells may be grown in vivo as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones may be isolated or purified from 
5 the culture medium or ascites fluid by conventional immunoglobulin purification procedures such 
as. for example, protein A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, 
dialysis, or affinity chromatography. 

The monoclonal antibodies may also be made by recombinant DNA methods, such as 
those described in U.S. Patent No. 4,8 1 6,567. DNA encoding the monoclonal antibodies of the 

10 invention can be readily isolated and sequenced using conventional procedures (e.g., by using 
oligonucleotide probes that are capable of binding specifically to genes encoding the heavy and 
light chains of murine antibodies). The hybridoma cells of the invention serve as a preferred 
source of such DNA. Once isolated, the DNA may be placed into expression vectors, which are 
then transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, 

1 5 or myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis 
of monoclonal antibodies in the recombinant host cells. The DNA also may be modified, for 
example, by substituting the coding sequence for human heavy and light chain constant domains 
in place of the homologous murine sequences [U.S. Patent No. 4.8 16,567; Morrison el al., supra] 
or by covalcntly joining to the immunoglobulin coding sequence all or part of the coding 

20 sequence for a non-immunoglobulin polypeptide. Such a non-immunoglobulin polypeptide can 
be substituted for the constant domains of an antibody of the invention, or can be substituted for 
the variable domains of one antigen-combining site of an antibody of the invention to create a 
chimeric bivalent antibody. 

The antibodies may be monovalent antibodies. Methods for preparing monovalent 

25 antibodies are well known in the art. For example, one method involves recombinant expression 
of immunoglobulin light chain and modified heavy chain. The heavy chain is truncated generally 
at any point in the Fc region so as to prevent heavy chain crosslinking. Alternatively, the relevant 
cysteine residues are substituted with another amino acid residue or are deleted so as to prevent 
crosslinking. 

30 /// vitro methods are also suitable for preparing monovalent antibodies. Digestion of 

antibodies to produce fragments thereof, particularly. Fab fragments, can be accomplished using 
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routine techniques known in the art. 



3. Human and Humanize d Antibodies 
The anti-SRT antibodies of the invention may further comprise humanized antibodies or 
human antibodies. Humanized forms of non-human (e.g.. murine) antibodies are chimeric 
5 immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab. Fab'. F(ab') : 
or other antigen-binding subsequences of antibodies) which eontain minimal sequence derived 
front non-human immunoglobulin. Humanized antibod.es include human immunoglobulins 
(recipient antibody) in which residues from a complementary determining region (CDR) of the 
recipient are replaced by residues from a CDR of a non-human species (donor antibody) such as 
1 0 mouse, rat or rabbit having the desired specificity, affinity and capacity. In some instances. Fv 
framework residues of the human immunoglobulin are replaced by corresponding non-human 
residues. Humanized antibodies may also comprise residues which arc found neither in the 
recipient antibody nor in the imported CDR or framework sequences. In general, the humanized 
antibody will comprise substantially all of at least one. and typically two. variable domains, in 
15 which all or substantially all of the CDR regions correspond to those of a non-human 
immunoglobulin and all or substantially all of the FR regions are those of a human 
immunoglobulin consensus sequence. The humanized antibody optimally also will comprise at 
least a portion of an immunoglobulin constant region (Fc), typically that of a human 
immunoglobulin (Jones et al„ Nature, 321:522-525 (1986); Ricchmann et al.. Nature. 332:323- 
20 329 (1988): and Presta, Curr Op. Struct. Biol.. 2:593-596(1992)]. 

Methods for humanizing non-human antibodies arc well known in the art. Generally, a 
humanized antibody has one or more amino acid residues introduced into it from a source which 
is non-human. These non-human ammo acid residues are often referred to as "import" residues, 
which are typically taken from an "import" variable domain. Humanization can be essentially 
25 performed following the method of Winter and co-workers [Jones et al., Njiture, 321:522-525 
( 1 986): Riechmann e, al.. Nature. 332:323-327 ( 1 988); Verhoeyen et al., Science, 239: 1 534- 1 536 
(1988)1. by substituting rodent CDRs or CDR sequences for the corresponding sequences of a 
human antibody. Accordingly, such "humanized" antibodies are chimeric antibodies (U.S. Patent 
No. 4.816.567). wherein substantially less than an intact human variable domain has been 
30 substituted by the corresponding sequence from a non-human species. In practice, humanized 
antibodies are typically human antibodies in which some CDR residues and possibly some FR 
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residues are substituted by residues from analogous sites in rodent antibodies. 

Human antibodies can also be produced using various techniques known in the art. 
including phage display libraries IHoogenboom and Winter. J. Mol. Bio h. 227:381 (1991): 
Marks el al.. .1. Mol. Biol. . 222:58 1 (1991 )]. The techniques of Cole el al. and Boerneret al. are 
also available for the preparation of human monoclonal antibodies (Cole et al.. Monoclonal 
5 Antibodies and Cancer Therap y. Alan R. lass. p. 77 (1985) and Boerner e. al.. J. Immunol. . 
147(1 ) :86-95 (1991)]. Similarly, human antibodies can be made by introducing of human 
immunoglobulin loci into transgenic animals, e.g.. mice in which the endogenous 
immunoglobulin genes have been partially or completely inactivated. Upon challenge, human 
antibody production is observed, which closely resembles .hat seen in humans in all respects. 
1 0 including gene rearrangement, assembly, and antibody repertoire. This approach is described, 
for example, in U.S. Patent Nos. 5.545.807; 5,545.806; 5.569.825; 5.625.126; 5,633,425; 
5.661,016, and in the following scientific publications: Marks el al.. Rio/Technology K). 779- 
783 ( 1 992): Lonberg et al.. Nature 368 856-859 ( 1 994); Morrison. Nature 368.812-13 (1 994): 
Fishwild et al., Nature Riotcchnology J4. 845-5 1 ( 1 996); Neuberger. Nature Biotechnology J4, 
15 826(1996): Lonberg and Hus/,ar intern Rev. Immunol. 1 3 65-93 (1995). 

4. Bispecific Antibodies 
Bispecific antibodies are monoclonal, preferably human or humanized, antibodies thai 
have binding specificities for at least two different antigens. In the present case, one of the 
20 binding specificities is for the SRT, the other one is for any other antigen, and preferably for a 
cell-surface protein or receptor or receptor subunit. 

Methods for making bispecific antibodies are known in the art. Traditionally, the 
recombinant production of bispecific antibodies is based on the co-expression of two 
immunoglobulin heavy-chain/light-chain pairs, where the two heavy chains have different 
25 specificities [Milstein and Cucllo. Nature. 305:537-539 (1983)]. Because of the random 
assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) produce 
a potential mixture of ten different antibody molecules, of which only one has the correct 
bispecific structure. The purification of the correct molecule is usually accomplished by affinity 
chromatography steps. Similar procedures are disclosed in WO 93/08829. published 13 May 
30 1 993, and in Traunecker et al.. EMBOJ. . 10:3655-3659 ( 1 99 1 ). 

Antibody variable domains with the desired binding specificities (antibody-antigen 
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combining sites) can be fused to immunoglobulin constant domain sequences. The fusion 
preferably is with an immunoglobulin heavy-chain constant domain, comprising at least part of 
the hinge, CH2, and CH3 regions. It is preferred to have the first heavy-chain constant region 
(CH 1 ) containing the site necessary for light-chain binding present in at least one of the fusions. 
DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, the immunoglobulin 
5 light chain, are inserted into separate expression vectors, and are co-transfected into a suitable 
host organism. For further details of generating bi specific antibodies see, for example, Suresh 
et al., Methods in Enzymolog y. 121:210 (1986). 

According to another approach described in WO 96/2701 1 . the interface between a pair 
of antibody molecules can be engineered to maximize the percentage of helerodimcrs which are 

1 0 recovered from recombinant cell culture. The preferred interface comprises at least a part of the 
CH3 region of an antibody constant domain. In this method, one or more small amino acid side 
chains from the interface of the first antibody molecule are replaced with larger side chains (e.g. 
tyrosine or tryptophan). Compensatory "cavities" of identical or similar size to the large side 
chain(s) are created on the interface of the second antibody molecule by replacing large amino 

15 acid side chains with smaller ones (e.g. alanine or threonine). This provides a mechanism for 
increasing the yield of the heterodimer over other unwanted end-products such as homodimcrs. 

Bi specific antibodies can be prepared as full length antibodies or antibody fragments (e.g. 
F(ab*) 2 bispecific antibodies). Techniques for generating bispecific antibodies from antibody 
fragments have been described in the literature. For example, bispecific antibodies can be 

20 prepared can be prepared using chemical linkage. Brennan etal.. Science 229:8 1 ( 1 985) describe 
a procedure wherein intact antibodies are proteolytically cleaved to generate F(ab') : fragments. 
These fragments are reduced in the presence of the dithiol complexing agent sodium arsenite to 
stabilize vicinal dithiols and prevent intermolccular disulfide formation. The Fab* fragments 
generated are then converted to thionitrobenzoate (TNB) derivatives. One of the Fab'-TNB 

25 derivatives is then reconverted to the Fab* -thiol by reduction with mercaptoethylamine and is 
mixed with an equimolar amount of the other Fab'-TNB derivative to form the bispecific 
antibody. The bispecific antibodies produced can be used as agents for the selective 
immobilization of enzymes. 

Fab' fragments may be directly recovered from E. coli and chemically coupled to form 

30 bispecific antibodies. Shalaby et J. Exp. Med. 175:2 17-225 (1992) describe the production 
of a fully humanized bispecific antibody F(ab') : molecule. Each Fab' fragment was separately 
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secreted from /:. coli and subjected to directed chemical coupling in vitro to form the bispecifie 
antibody. The bispecifie antibody thus formed was able to bind to cells overexpressing the 
ErbB2 receptor and normal human T cells, as well as trigger the lytic activity of human cytotoxic 
lymphocytes against human breast tumor targets. 

Various technique for making and isolating bispecifie antibody fragments directly from 
5 recombinant cell culture have also been described. For example, bispecifie antibodies have been 
produced using leucine zippers. Kostclny et al., J. Immunol. 1 48(5): I 547- 1 553 ( 1992). The 
leucine zipper peptides from the Fos and Jun proteins were linked to the Fab' portions of two 
different antibodies by gene fusion. The antibody homodimers were reduced at the hinge region 
to form monomers and then re-oxidized to form the antibody heterodimers. This method can also 

10 be utilized for the production of antibody homodimers. The "diabody" technology described by 
Hollinger et c//., Proc. Natl. Acad. Sci. USA 90:6444-6448 ( 1 993) has provided an alternative 
mechanism for making bispecifie antibody fragments. The fragments comprise a heavy-chain 
variable domain (V (l ) connected to a light-chain variable domain (V, ) by a linker which is too 
short to allow pairing between the two domains on the same chain. Accordingly, the V H and V, 

15 domains of one fragment are forced to pair w ith the complementary V, and V H domains of 
another fragment, thereby forming two antigen-binding sites. Another strategy for making 
bispecifie antibody fragments by the use of single-chain Fv (sFv) dimers has also been reported. 
See, Gruber et at., J. Immunol. 1 52:5368 (1994). 

Antibodies with more than two valencies are contemplated. For example, tri specific antibodies 

20 can be prepared. Tutt et aL, J. Immunol. 147:60 (1991 ). 

Exemplary bispecifie antibodies may bind to two different epitopes on a given SRT 
polypeptide herein. Alternatively, an anti-SRT polypeptide arm may be combined with an arm 
which binds to a triggering molecule on a leukocyte such asaT-cell receptor molecule (e.g. CD2. 
CD3, CD28, or B7). or Fc receptors for IgG ( FcyR), such as FcyRI (CD64), FcyRII (CD32) and 

25 FcyRIII (CD 16) so as to focus cellular defense mechanisms to the cell expressing the particular 
SRT polypeptide. Bispecifie antibodies may also be used to localize cytotoxic agents to cells 
which express a particular SRT polypeptide. These antibodies possess a SRT-binding arm and 
an arm which binds a cytotoxic agent or a radionuclide chelator, such as EOTUBE, DPTA, 
DOTA, orTETA. Another bispecifie antibody of interest binds the SRT polypeptide and further 

30 binds tissue factor (TF). 
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5 Hetcroconiug "^ Antibodies 
Heteroconjugate antibodies are also within the scope of the present invention. 
Hcteroconjusa.e antibodies are composed of two cova.cnt.y joined antibodies. Such antibod.es 
nave for example, been proposed to target immune system cells to unwanted ce.ls [U.S. Patent 
No 4 676 ,80,. and for treatment of HIV infection [WO 9 , /00360: WO 92/200373: BP 03089,. 
„ .s contemplated that the antibodies may be prepared in rUro using known methods in synthcttc 
protein chemistry, including those involving crosslinks agents. For example, immunotoxms 
may be constructed ustng a disulfide exchange reaction or by forming a thioethcr bond. 
Examples of suitable reagents for this purpose include iminothto.atc and methyl-4- 
n.rcaptobutyrimidatc and those disclosed, for example, in U.S. Patent No. 4.676.980. 

6. Effector Function Eng ineering 
„ ^ be desirable to modify the antibody of the invention with respect to effector 
function so as to enhance, c.,. . the effectiveness of the antibody tn treating cancer. For example, 
cysteine residue(s) may be introduced into the Fc region, thereby allowtng interchain d.su.l.dc 
bond formation in this region. The homodimcne antibody thus generated may have improved 
vernalization capability and/or increased complement-mediated cel. killing and antibody- 
dependent ceHular cytotoxicity (ADCQ. SeeCaron el cii, EJ^xr>jyled.. 126: I 191-1 195 (1992) 
,nd Shopes J . Immunol .. 148: 29 1 8-2922 ( 1 992). Homodimenc antibodies with enhanced ant- 
tumor activity may also be prepared ustng heterobtfunctional cross-linkers as described in Wolf. 
eta l Cancer Research. 53: 2560-2565 ,1993). Alternatively, an antibody can be eng.neered that 
has du^^Tnd may thereby have enhanced complement lysis and ADCC capabi.it.es. 
See Stevenson ct a,.. 3: 2.9-230 ( .989). 
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7 immnnoconiuaates 
The invention also pertains to tmmunoconjugates comprising an antibody conjugated to 
a cytotoxic agent such as a chemotherapeutic agent, toxin U,,,. an enzymatiea.ly active toxm ot 
rial fungal, plant, or animal origin, or fragments thereof,, or a radioactive isotope (,.,.. a 



bacten 



radioconjugate). 

Chemotherapeutic agents useful in the generation of such immunoconjugates have been 
described above. Enzymatiea.ly aet.ve toxins and fragments thereof , ha, can be used mcludc 
diphtheria A chain, nonbinding active fragments of diphtheria toxin, exotoxin A cha.n (trom 
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Pseudomonas aeruginosa)* ricin A chain, ubrin A chain, modeccin A chain, alpha-sarein, 
Aleitrites fordii proteins, diunthin proteins, Phyiolaca anwheana proteins (PAPI. PAP1I, and 
PAP S), momordica churantiu inhibitor, curein, crotin, sapaonaria officinalis inhibitor, gelonin, 
mitogcllin. reslrictoein, phenomycin, enomyein, and the tricothcccncs. A variety of radionuclides 
are available for the production of radioconjugated antibodies. Examples include 212 Bi, l31 In. 
5 W Y, and IS{, Re. Conjugates of the antibody and cytotoxic agent are made using a variety 

of Afunctional protein-coupling agents such as N-succinimidyl-3-(2-pyridyldithiol ) propionate 
(SPDP), iminothiolane (IT), Afunctional derivatives of imidoesters (such as dimethyl 
adipimidate HCL), active esters (such as disuccinimidyl suberate), aldehydes (such as 
glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) hexanediamine), bis- 

1 0 diazonium derivatives (such as bis-(p-diazoniumbenzoyl )-ethylcncdiamine), diisocyanates (such 
as tolyene 2,6-diisoeyanatc), and bis-active fluorine compounds (such as 1 ,5-dilluoro-2,4- 
dinitrobenzene). For example, a ricin immunotoxin can be prepared as described in Vitetta et 
al.. Science . 238 : 1098 (1987). Carbon- 1 4-labeled 1 -isothiocyanatobenzyl-3-methyldiethylene 
triaminepentaacetic acid (MX-DTPA) is an exemplary chelating agent for conjugation of 

15 radionucleotide to the antibody. See W094/1 1 026. 

In another embodiment, the antibody may be conjugated to a "receptor" (such 
slreptavidin) for utilization in tumor pretargeting wherein the antibody-receptor conjugate is 
administered to the patient, followed by removal of unbound conjugate from the circulation using 
a clearing agent and then administration of a "ligand" (e.g., avidin) that is conjugated to a 

20 cytotoxic agent (e.g., a radionucleotide). 



The antibodies disclosed herein may also be formulated as immunoliposomes. 
Liposomes containing the antibody are prepared by methods known in the art, such as described 
25 in Epstein et a!., Proc. Natl. Acad. Sci. USA , 82: 3688 (1985); Hwang et aL, Proc. Natl Acad. 
Sei. USA, 77: 4030 (1980); and U.S. Pat. Nos. 4,485,045 and 4,544,545. Liposomes with 
enhanced circulation time are disclosed in U.S. Patent No. 5,013,556. 

Particularly useful liposomes can be generated by the reverse-phase evaporation method 
with a lipid composition comprising phosphatidylcholine, cholesterol, and PEG-derivatized 
30 phosphatidylcthanolamine (PEG-PE). Liposomes are extruded through filters of defined pore 
size to yield liposomes with the desired diameter. Fab* fragments of the antibody of the present 



8. 



I m munoliposomes 
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invention can be conjugated to the liposomes as described in Martin ct a\ .. J. Biol. Chcm. , 257 : 
286-288 (1982) via a disulfide-intcrchange reaction. A cheniotherapeutic agent (such as 
Doxorubicin) is optionally contained within the liposome. SeeGabizon eta/., J. National Cancer 
Inst.. 81(19): 1484(1989). 

5 9. Pharmaceutical Compositions of Antibodies 

Antibodies specifically binding a SRT polypeptide identified herein, as well as other 
molecules identified by the screening assays disclosed hereinbefore, can be administered for the 
treatment of various disorders in the form of pharmaceutical compositions. 

If the SRT polypeptide is intracellular and whole antibodies are used as inhibitors, 
10 internalizing antibodies are preferred. However, lipofections or liposomes can also be used to 
deliver the antibody, or an antibody fragment, into cells. Where antibody fragments are used, the 
smallest inhibitory fragment that specifically binds to the binding domain of the target protein 
is preferred. For example, based upon the variable-region sequences of an antibody, peptide 
molecules can be designed that retain the ability to bind the target protein sequence. Such 
1 5 peptides can be synthesized chemically and/or produced by recombinant DN A technology. Sec, 
e.g.. Marasco et a/.. Proc. Natl. Acad. Sci. USA . 90: 7889-7893 (1993). The formulation 
herein may also contain more than one active compound as necessary for the particular indication 
being treated, preferably those with complementary activities that do not adversely affect each 
other. Alternatively, or in addition, the composition may comprise an agent that enhances its 
20 function, such as. for example, a cytotoxic agent, cytokine, cheniotherapeutic agent, or growth- 
inhibitory agent. Such molecules are suitably present in combination in amounts that are 
effective for the purpose intended. 

The active ingredients may also be entrapped in microcapsules prepared, for example, by 
coacervation techniques or by interfaciai polymerization, for example, hydroxymethylcellulose 
25 orgelatin-microcapsules and poly-(methylmethacylatc) microcapsules, respectively, in colloidal 
drug delivery systems (for example, liposomes, albumin microspheres, microemulsions, nano- 
particles, and nanocapsules) or in macroemulsions. Such techniques are disclosed in 
Remingtons Ph arm ace u t i c a 1 Scjcn ee s , supra. 

The formulations to be used for in vivo administration must be sterile. This is readily 
30 accomplished by filtration through sterile filtration membranes. 

Sustained-release preparations may be prepared. Suitable examples of sustained-release 
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preparations include semipermeable matrices of solid hydrophobic polymers containing the 
antibody, which matrices are in the form of shaped articles, e.g., films, or microcapsules. 
Examples of sustained-release matrices include polyesters, hydrogels (for example. poly(2- 
hydroxyethyl-methacrylate), or poly(vinylalcohol)), poiylactidcs (U.S. Pat. No. 3,773,919), 
copolymers of L-glutamic acid and y ethyl-L-glutamate, non-degradable ethylene-vinyl acetate, 
5 degradable lactic acid-glycolic acid copolymers such as the LUPRON DEPOT 1M (injectable 
microspheres composed of lactic acid-glycolic acid copolymer and leuprolide acetate), and poly- 
D-(-)-3-hydroxybutyric acid. While polymers such as ethylene-vinyl acetate and lactic acid- 
glycolic acid enable release of molecules for over 100 days, certain hydrogels release proteins 
for shorter time periods. When encapsulated antibodies remain in the body for a long time, they 

10 may denature or aggregate as a result of exposure to moisture at 37 "C, resulting in a loss of 
biological activity and possible changes in immunogenicity. Rational strategies can be devised 
for stabilization depending on the mechanism involved. For example, if the aggregation 
mechanism is discovered to be intermolecular S-S bond formation through thio-disulfide 
interchange, stabilization may be achieved by modifying sulfhydryl residues, lyophilizing from 

15 acidic solutions, controlling moisture content, using appropriate additives, and developing 
specific polymer matrix compositions. 

G. Uses for anti-SRT Antibodies 

The anti-SRT antibodies of the invention have various utilities. For example, anti-SRT 
20 antibodies may be used in diagnostic assays for SRT, e.g., detecting its expression in specific 
cells, tissues, or serum. Various diagnostic assay techniques known in the art may be used, such 
as competitive binding assays, direct or indirect sandwich assays and immunoprecipitation assays 
conducted in either heterogeneous or homogeneous phases [Zola, Monoclonal Antibodies: A 
Manual of Techniques . CRC Press, Inc. (1987) pp. 147-158]. The antibodies used in the 
25 diagnostic assays can be labeled with a detectable moiety. The detectable moiety should be 
capable of producing, either directly or indirectly, a detectable signal. For example, the 
detectable moiety may be a radioisotope, such as H, l4 C, ^ 2 P, XS S, or l25 I, a fluorescent or 
chemiluminescent compound, such as fluorescein isothiocyanate, rhodamine, or luciferin, or an 
enzyme, such as alkaline phosphatase, beta-gal actosidase or horseradish peroxidase. Any method 
30 known in the art for conjugating the antibody to the detectable moiety may be employed, 
including those methods described by Hunter et al.. Nature . 144 :945 (1962); David et ah. 
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Biochemistry , J_3: 1014 (1974); Pain et al., J. Immunol. Meth. , 40:219 ( 1981 ); and Nygren, J. 
Histochcm. and Cytochcm. , 30:407 (1982). 

Anti-SRT antibodies also are useful for the affinity purification of SRT from recombinant 
cell culture or natural sources. In this process, the antibodies against SRT are immobilized on 
a suitable support, such a Scphadex resin or filter paper, using methods well known in the art. 
5 The immobilized antibody then is contacted with a sample containing the SRT to be purified, and 
thereafter the support is washed with a suitable solvent that will remove substantially all the 
material in the sample except the SRT, which is bound to the immobilized antibody. Finally, the 
support is washed with another suitable solvent that will release the SRT from the antibody. 

The following examples are offered for illustrative purposes only, and are not intended 
10 to limit the scope of the present invention in any way. 

All patent and literature references cited in the present specification arc hereby 
incorporated by reference in their entirety. 

15 

EXAMPLES 

Commercially available reagents referred to in the examples were used according to 
manufacturers instructions unless otherwise indicated. The source of those cells identified in 
the following examples, and throughout the specification, by ATCC accession numbers is the 
20 American Type Culture Collection, Manassas, VA. 

EXAMPLE 1 
Isolation of SRT cDNAs 
I . Preparation of oligo dT primed cDNA library 
25 mRNA was isolated from human tissue using reagents and protocols from Invitrogen, San 

Diego, CA (Fast Track 2). This RN A was used to generate an oligo dT primed cDN A library in 
the vector pRK5D using reagents and protocols from Life Technologies, Gaithersburg, MD 
(Super Script Plasmid System). In this procedure, the double stranded cDNA was sized to 
greater than 1 000 bp and the Sal I/Not I linkered cDNA was cloned into XhoI/NotI cleaved vector. 
30 pRK5D is a cloning vector that has an sp6 transcription initiation site followed by an Sfil 
restriction enzyme site preceding the XhoI/NotI cDNA cloning sites. 
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2. Preparation of random primed cDNA library 

A secondary cDNA library was generated in order to preferentially represent the 5" ends 
of the primary cDNA clones. Sp6 RNA was generated from the primary library (described 
above), and this RNA was used to generate a random primed cDNA library in the vector pSST- 
AMY.O using reagents and protocols from Life Technologies (Super Script Plasmid System. 
5 referenced above). In this procedure the double stranded cDNA was sized to 500-1000 bp, 
linkered with blunt to Noll adaptors, cleaved with Sfil, and cloned into Sfil/NotI cleaved vector. 
pSST-AM Y.O is a cloning vector that has a yeast alcohol dehydrogenase promoter preceding the 
cDNA cloning sites and the mouse amylase sequence (the mature sequence without the secretion 
signal) followed by the yeast alcohol dehydrogenase terminator, after the cloning sites. Thus, 
1 0 cDN As cloned into this vector that are fused in frame w ith the amylase sequence will lead to the 
secretion of amylase from appropriately transfected yeast colonies. 

3. Transformation and Detection 

DNA from the library described in paragraph 2 above was chilled on ice to which was 

15 added electrocompetent DH10B bacteria (Life Technologies, 20 ml). The bacteria and vector 
mixture was then electroporated as recommended by the manufacturer. Subsequently, SOC 
media (Life Technologies. 1 ml) was added and the mixture was incubated at 37 C C for 30 
minutes. The transformanls were then plated onto 20 standard 150 mm LB plates containing 
ampicillin and incubated for 1 6 hours (37 °C). Positive colonies were scraped off the plates and 

20 the DNA was isolated from the bacterial pellet using standard protocols, e.g. CsCl-gradient. The 
purified DNA was then carried on to the yeast protocols below. 

The yeast methods were divided into three categories: ( 1 ) Transformation of yeast with 
the plasmid/cDNA combined vector; (2) Detection and isolation of yeast clones secreting 
amylase: and (3) PCR amplification of the insert directly from the yeast colony and purification 

25 of the DNA for sequencing and further analysis. 

The yeast strain used was HD56-5A (ATCC-90785). This strain has the following 
genotype: MAT alpha, ura3-52. leu2-3, Ieu2-112. his3-ll, his3-15, MAL + , SUC + , GAL + . 
Preferably, yeast mutants can be employed that have deficient post-translational pathways. Such 
mutants may have translocation deficient alleles in sec7\, .vcc72, .vc<'62, with truncated sec7\ 

30 being most preferred. Alternatively, antagonists (including antisense nucleotides and/or ligands) 
which interfere with the normal operation of these genes, other proteins implicated in this post 
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translation pathway (e.g., SEC61p, SEC72p, SEC62p. SEC63p, TDJlp or SSAlp-4p) or the 
complex formation of these proteins may also be preferably employed in combination with the 
amylase-expressing yeast. 

Transformation was performed based on the protocol outlined by Gietz et al., Nuel. Acid. 
Res. , 20: 1425 (1992). Transformed cells were then inoculated from agar into YEPD complex 
5 media broth ( 1 00 ml) and grown overnight at 30 C. The YEPD broth was prepared as described 
in Kaiser et al.. Methods in Yeast Genetics , Cold Spring Harbor Press, Cold Spring Harbor, NY, 
p. 207 (1994). The overnight culture was then diluted to about 2 x 10 (1 cells/ml (approx. 
OD WK) =(). 1 ) into fresh YEPD broth (500 ml) and rcgrown to 1 x 1 0 7 cells/ml (approx. OD fl00 =0.4- 
0.5). 

1 0 The cells were then harvested and prepared for transformation by transfer into GS3 rotor 

bottles in a Sorval GS3 rotor at 5,000 rpm for 5 minutes, the supernatant discarded, and then 
resuspended into sterile water, and centrifuged again in 50 ml falcon tubes at 3,500 rpm in a 
Beckman GS-6KR centrifuge. The supernatant was discarded and the cells were subsequently 
washed with LiAc/TE (10 ml, 10 mM Tris-HCI, 1 mM EDTA pH 7.5, 100 mM LLOOCCH,), 

15 and resuspended into LiAc/TE (2.5 ml). 

Transformation took place by mixing the prepared cells ( 100 ul) with freshly denatured 
single stranded salmon testes DNA (Lofstrand Labs, Gaithersburg, MD) and transforming DNA 
( l u.g, vol. < 10 ul) in microfuge tubes. The mixture was mixed briefly by vortexing, then 40 c /r 
PEG/TE (600 ul, 40% polyethylene glycol-4000, 10 mM Tris-HCI, 1 mM EDTA, 100 mM 

20 Li 2 OOCCH 3 , pH 7.5) was added. This mixture was gently mixed and incubated at 30°C while 
agitating for 30 minutes. The cells were then heat shocked al 42 C for 15 minutes, and the 
reaction vessel centrifuged in a microfuge at 12,000 rpm for 5-10 seconds, decanted and 
resuspended into TE (500 ul, 10 mM Tris-HCI. I mM EDTA pH 7.5) followed by 
recent rifugat ion. The cells were then diluted into TE (1 ml) and aliquots (200 ul) were spread 

25 onto the selective media previously prepared in 150 mm growth plates (VWR). 

Alternatively, instead of multiple small reactions, the transformation was performed using 
a single, large scale reaction, wherein reagent amounts were scaled up accordingly. 

The selective media used was a synthetic complete dextrose agar lacking uracil (SCD- 
Ura) prepared as described in Kaiser et al., Methods in Yeast Genetics . Cold Spring Harbor 

30 Press, Cold Spring Harbor. NY, p. 208-2 10(1 994). Transformants were grown at 30' C for 2-3 
days. 
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The detection of colonies secreting amylase was performed by including red starch in the 
selective growth media. Starch was coupled to the red dye (Reactive Red- 1 20. Sigma) as per the 
procedure described by Biely el ah. Anal. Bioehem. , I 72 : l 76- 1 79 ( 1988). The coupled starch 
was incorporated into the SCD-Ura agar plates at a final concentration of 0A57r (w/v), and was 
buffered with potassium phosphate to a pH of 7.0 (50-100 mM final concentration). 

The positive colonies were picked and streaked across fresh selective media (onto 150 
mm plates) in order to obtain well isolated and identifiable single colonies. Well isolated single 
colonies positive for amylase secretion were detected by direct incorporation of red starch into 
buffered SCD-Ura agar. Positive colonies were determined by their ability to break down starch 
resulting in a clear halo around the positive colony visualized directly. 

4. Isolation of DNA by PCR Amplification 

When a positive colony was isolated, a portion of it was picked by a toothpick and diluted 
into sterile water (30 ud) in a 96 well plate. At this time, the positive colonies were either fro/en 
and stored for subsequent analysis or immediately amplified. An aliquot of cells (5 ul) was used 
as a template for the PCR reaction in a 25 ul volume containing: 0.5 ul Klentaq (Clontcch, Palo 
Alto, CA); 4.0 ul 10 mM dNTP's (Perkin Elmer-Cetus); 2.5 ul Klentaq buffer (Clontech); 0.25 
ul forward oligo 1 ; 0.25 ul reverse oligo 2: 12.5 ul distilled water. The sequence of the forward 
oligonucleotide 1 was: 

5-TGTAAAACGACGGCCAGT TAAATAGACCTGCAATTATTAATCT -3' (SEQ ID 
NO:563) 

The sequence of reverse oligonucleotide 2 was: 

5 -CAGGAAACAGCTATGACC ACCTGCACACCTGCAAATCCATT -3' (SEQ ID 
NO:564) 

PCR was then performed as follows: 



a. 



3 cycles of: 



3 cycles of: 



Denature 

Denature 

Anneal 

Extend 

Denature 

Anneal 

Extend 



92 C, 5 minutes 

92 °C, 30 seconds 

59 °C, 30 seconds 
72 °C, 60 seconds 

92 C, 30 seconds 

57 °C, 30 seconds 
72°C, 60 seconds 
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25 cycles of: Denature 
Anneal 
Extend 



92 C, 30 seconds 

55 C, 30 seconds 
72"X\ 60 seconds 



e. 



Hold 



4 C 



5 



The underlined regions of the oligonucleotides disclosed above annealed to the ADH 
promoter region and the amylase region, respectively, and amplified a 307 bp region from vector 
pSST-AMY.O when no insert was present. Typically, the first 18 nucleotides of the 5' end of 
these oligonucleotides contained annealing sites for the sequencing primers. Thus, the total 
1 0 product of the PCR reaction from an empty vector was 343 bp. However, signal sequence-fused 
cDNA resulted in considerably longer nucleotide sequences. 

Following the PCR. an aliquot of the reaction (5 ul) was examined by agarose gel 
electrophoresis in a \ c k agarose gel using a Tris-Borate-EDTA (TBE) buffering system as 
described by Sambrook et al., supra . Clones resulting in a single strong PCR product larger than 
15 400 bp were further analyzed by DNA sequencing after purification with a 96 Qiaquick PCR 
clean-up column (Qiagen Inc., Chatsworth, CA). 

cDNA molecules isolated from this amylase screen are shown in Figures l -562 (SEQ ID 
NOS: 1-562, respectively), wherein the nucleotides **NT and "X" represent any nucleotide. The 
cDNA libraries from which these cDNA molecules were obtained arc as follows: 
20 (a) Human liver tissue 

Figures 119, 124 and 130. 

(b) Human placenta tissue 
Figures 20-73. 

(c) Human retina tissue 



(d) Human salivary gland tissue 
Figures 76-78. 

(e) Human umbilical vein endothelial cells 

Figures 79-80, 97, 1 10, 245-252, 254-260, 263-265, 4 1 3-42 1 , 433-437, 444-449, 454- 
30 456, 462-467, 477-478, 480-485, 492-493, 5 1 5 and 548. 

(f) Human thyroid tissue 

Figures 82-84, 90-91, 96, 109, 141-143 and 268. 

(g) Human small intestine tissue 



25 



Figures 74-75, 81. 107-108. 139-140 and 340-341. 
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Figures 85-86, 144-161 and 267. 




(h) 


Human colon carcinoma tissue 






Figure 87. 




(i) 


Human lung endothelial cells 






Figures 88 and 93-95. 


5 


(j) 


Human hypothalamus tissue 






Figure 89. 




(k) 


Human breast carcinoma tissue 






Figures 92, 111-115, 206-213, 228-232, 269-270, 450-453. 534-547, 556 and 559. 




(1) 


Human aortic endothelial cells 


10 




Figures 98- 1 02, 1 25- 1 29, 1 36- 1 38, 216-21 7, 253, 26 1 -262, 300-30 1 , 327-330, 365-367 




and 385-387. 




(m) 


Human uterus tissue 






Figures 103-106, 170-173, 176-183, 233-235, 238, 242-244, 266. 311-312 and 557. 




(n) 


Human lung carcinoma tissue 


15 




Figures 1 06- 1 08, 20 1 -205, 22 1 -227, 27 1 -274, 334-339. 342-348, 350-35 1 , 360-364, 372, 




388-408. 41 1, 431-432, 479, 558 and 560-561. 




(o) 


Human mammary epithelial cells 






Figures 1 19-121, 214 and 316-320. 




(P) 


Human chronic myeloszenous leukemia tissue 


20 




Figures 122-123 and 131-135. 




(q) 


Human spinal cord tissue 






Figures 162. 167-169, 198-200, 236 and 315. 




(r) 


Human fetal brain tissue 






Figures 163-166, 1 74-1 75, 332-333, 422-430 and 494-502. 


25 


(s) 


Human fetal kidney tissue 






Figures 1 84- 1 97, 409-4 1 0 and 4 1 2. 




(t) 


Human prostate tissue 






Figures 215, 237, 239-241 and 349. 




(u) 


Human mammary gland tissue 


30 




Figures 218-220, 275-276 and 331. 




(v) 


Human adenocarcinoma tissue 
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Figures 277-299 and 302-310. 
Human fetal small intestine tissue 
Figures 313-314. 
Human fetal lung tissue 
Figures 32 1 -326. 
Human testis tissue 

Figures 352-359, 368-37 1 , 377-384, 438-443, 457-46 1 , 486-49 1,51 3-5 1 4, 5 1 6-527 and 
Human MCF-7 eells 

Figures 373-376. 468-476, 503-512, 528-533 and 549-555. 

EXAMPLE 2 
Identification of full-length cDNA molecules 
Oligonucleotide probes may be generated from the sequence of any of the SRT 
polynucleotide sequences disclosed herein, including those shown in Figures 1 to 562 and used 
1 5 to screen human cDNA libraries prepared as described in paragraph I of Example 1 above. The 
cloning vector may be pRK5B (pRK5B is a precursor of pRK5D that does not contain the Sfil 
site; see, Holmes et af. Science 253: 1 278- 1 280 ( 1 99 1 )), and the cDN A size cut may be less than 
2800 bp. The oligonucleotides probes may be synthesized: 1 ) to identify by PCR a cDNA library 
that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full- 
20 length coding sequence for SRT. Forward and reverse PCR primers generally range from 20 to 
30 nucleotides and arc often designed to give a PCR product of about 100-1000 bp in length. 
The probe sequences are typically 40-55 bp in length. In order to screen several libraries for a 
full-length clone, DN A from the libraries may be screened by PCR amplification, as per Ausubel 
et al., Current Protocols in Molecular Biology , supra, with the PCR primer pair. A positive 
25 library may then be used to isolate clones encoding the gene of interest using the probe 
oligonucleotide and one of the primer pairs. 



30 EXAMPLE 3 

Use of SRT polynucleotides as hybridization probes 



(w) 
(x) 

5 (y) 

562 
(/) 
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The following method describes use of a nucleotide sequence encoding SRT as a 
hybridization probe. 

DNA comprising the coding sequence of lull-length or mature SRT is employed as a 
probe to screen lor homologous DNAs (such as those encoding naturally-occurring variants of 
SRT) in human tissue cDNA libraries or human tissue genomic libraries. 
5 Hybridization and washing of filters containing cither library DNAs is performed under 

the following high stringency conditions. Hybridization of radiolabeled SRT-derived probe to 
the filters is performed in a solution of 50 ( 7c formamide, 5x SSC, OA c /c SDS, 0.1 c h sodium 
pyrophosphate, 50 mM sodium phosphate, pH 6.8, 2x Dcnhardt s solution, and HY.c dextran 
sulfate at 4TC for 20 hours. Washing of the filters is performed in an aqueous solution of 0.1 x 
1 0 SSC and 0. 1 % SDS at 42°C. 

DNAs having a desired sequence identity with the DNA encoding full-length native 
sequence SRT can then be identified using standard techniques known in the art. 



This example illustrates preparation of an unglycosylated form ot SRT by recombinant 
expression in /:. coli. 

The DNA sequence encoding SRT is initially amplified using selected PCR primers. The 
primers should contain restriction enzyme sites which correspond to the restriction enzyme sites 

20 on the selected expression vector. A variety of expression vectors may be employed. An 
example of a suitable vector is pBR322 (derived from coli; see Bolivar et al.. Gene , 2:95 
(1977)) which contains genes for ampicillin and tetracycline resistance. The vector is digested 
with restriction enzyme and dcphosphorylated. The PCR amplified sequences are then ligated 
into the vector. The vector will preferably include sequences which encode for an antibiotic 

25 resistance gene, a trp promoter, a polyhis leader (including the first six STII codons, polyhis 
sequence, and enterokinase cleavage site), the SRT coding region, lambda transcriptional 
terminator, and an argU gene. 

The ligation mixture is then used to transform a selected E. coli strain using the methods 
described in Sambrook ct al.. supra . Transformants are identified by their ability to grow on LB 

30 plates and antibiotic resistant colonies are then selected. Plasmid DNA can be isolated and 
confirmed by restriction analysis and DNA sequencing. 



EXAMPLE 4 



15 



Expression of SRT in E. coli 
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Selected clones can be grown overnight in liquid culture medium such as LB broth 
supplemented with antibiotics. The overnight culture may subsequently be used to inoculate a 
larger scale culture. The cells are then grown to a desired optical density, during which the 
expression promoter is turned on. 

Alter culturing the cells for several more hours, the cells can be harvested by 
5 centrifugation. The cell pellet obtained by the ccnlrifugation can be solubilized using various 
agents known in the art, and the solubilized SRT protein can then be purified using a metal 
chelating column under conditions that allow tight binding of the protein. 

SRT may be expressed in E. coli in a poly-His tagged form, using the following 
procedure. The DNA encoding SRT is initially amplified using selected PCR primers. The 

1 0 primers will contain restriction enzyme sites which correspond to the restriction enzyme sites on 
the selected expression vector, and other useful sequences providing for efficient and reliable 
translation initiation, rapid purification on a metal chelation column, and proteolytic removal 
with enterokinase. The PCR-amplified, poly-His tagged sequences are then ligatcd into an 
expression vector, which is used to transform an E. coli host based on strain 52 (W3110 

1 5 fuhA(tonA) Ion galE rpoHts(htpRts) clpP(ladq). Transformants are first grown in LB containing 
50 mg/ml carbenicillin at 30°C with shaking until an O.D.600 of 3-5 is reached. Cultures are 
then diluted 50-100 fold into CRAP media (prepared by mixing 3.57 g (NH 4 ) 2 S0 4 , 0.7 1 g sodium 
citrate»2H20, 1.07 g KC1, 5.36 g Difco yeast extract, 5.36 g Sheffield hycase SF in 500 mL 
water, as well as 110 mM MPOS, pH 7.3, 0.55% (w/v) glucose and 7 mM MgS0 4 ) and grown 

20 for approximately 20-30 hours at 30°C with shaking. Samples are removed to verify expression 
by SDS-PAGE analysis, and the bulk culture is centrifuged to pellet the cells. Cell pellets are 
frozen until purification and refolding. 

E. coli paste from 0.5 to 1 L fermentations (6-10 g pellets) is resuspended in 10 volumes 
(w/v) in 7 M guanidine, 20 mM Tris, pH 8 buffer. Solid sodium sulfite and sodium tetrathionate 

25 is added to make final concentrations of 0.1 M and 0.02 M, respectively, and the solution is 
stirred overnight at 4°C. This step results in a denatured protein with all cysteine residues 
blocked by sulfitolization. The solution is centrifuged at 40,0(30 rpm in a Beckman Ultracentifuge 
for 30 min. The supernatant is diluted w ith 3-5 volumes of metal chelate column buffer (6 M 
guanidine, 20 mM Tris, pH 7.4) and filtered through 0.22 micron filters to clarify. The clarified 

30 extract is loaded onto a 5 ml Qiagen Ni-NTA metal chelate column equilibrated in the metal 
chelate column buffer. The column is washed with additional buffer containing 50 mM 
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imidazole (Calbiochem, Utrol grade), pH 7.4. The protein is eluted with buffer containing 250 
mM imidazole. Fractions containing the desired protein are pooled and stored at 4 C. Protein 
concentration is estimated by its absorbance at 280 nm using the calculated extinction coefficient 
based on its amino acid sequence. 

The proteins are refolded by diluting the sample slowly into freshly prepared refolding 
5 buffer consisting of: 20 mM Tris, pH 8.6, 0.3 M NaCl, 2.5 M urea, 5 mM cysteine, 20 mM 
glycine and 1 mM EDTA. Refolding volumes are chosen so that the final protein concentration 
is between 50 to 100 micrograms/ml. The refolding solution is stirred gently at 4 C for 12-36 
hours. The refolding reaction is quenched by the addition of TFA to a final concentration of 
0A c ,'c (pH of approximately 3). Before further purification of the protein, the solution is filtered 

10 through a 0.22 micron filter and acetonitrile is added to 2- 1 07c final concentration. The refolded 
protein is chromatographed on a Poros Rl/H reversed phase column using a mobile buffer of 
0. 1 c ,h TFA with clution with a gradient of acetonitrile from 1 0 to 80%. Aliquots of fractions with 
A280 absorbance are analyzed on SDS polyacrylamide gels and fractions containing 
homogeneous refolded protein are pooled. Generally, the properly refolded species of most 

15 proteins arc eluted at the lowest concentrations of acetonitrile since those species are the most 
compact with their hydrophobic interiors shielded from interaction with the reversed phase resin. 
Aggregated species are usually eluted at higher acetonitrile concentrations. In addition to 
resolving misfolded forms of proteins from the desired form, the reversed phase step also 
removes endotoxin from the samples. 

20 Fractions containing the desired folded SRT polypeptide are pooled and the acetonitrile 

removed using a gentle stream of nitrogen directed at the solution. Proteins are formulated into 
20 mM Hepes, pH 6.8 with 0.14 M sodium chloride and 4% mannitol by dialysis or by gel 
filtration using G25 Superfine (Pharmacia) resins equilibrated in the formulation buffer and 
sterile filtered. 

25 

EXAMPLE 5 
Expression of SRT in mammalian cells 
This example illustrates preparation of a potentially glycosylated form of SRT by 
recombinant expression in mammalian cells. 
30 The vector, pRK5 (see EP 307,247, published March 15, 1989), is employed as the 

expression vector. Optionally, the SRT DNA is ligated into pRK5 with selected restriction 
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enzymes to allow insertion of the SRT DNA using ligation methods such as described in 
Sambrook et al., supra . The resulting vector is called pRK5-SRT. 

In one embodiment, the selected host cells may be 293 cells. Human 293 cells (ATCC 
CCL 1573) are grown to confluence in tissue culture plates in medium such as DMKM 
supplemented with fetal calf serum and optionally, nutrient components and/or antibiotics. 
5 About 10 ug pRK5-SRT DNA is mixed with about 1 ug DNA encoding the VA RNA gene 
[Thimmappaya et al., Cell, 31:543 (1982)] and dissolved in 500 ul of 1 mM Tris-HCl, 0.1 mM 
EDTA, 0.227 M CaCb. To this mixture is added, dropwise, 500 ul of 50 mM HEPES (pH 7.35), 
280 mM NaCl, 1 .5 mM NaPC> 4 , and a precipitate is allowed to form for 10 minutes at 25°C. The 
precipitate is suspended and added to the 293 cells and allowed to settle for about lour hours at 

10 37°C. The culture medium is aspirated off and 2 ml of 20% glycerol in PBS is added for 30 
seconds. The 293 cells are then washed with serum free medium, fresh medium is added and the 
cells are incubated for about 5 days. 

Approximately 24 hours after the transfections, the culture medium is removed and 
replaced with culture medium (alone) or culture medium containing 200 uCi/ml °S-cysteinc and 

15 200 u.Ci/ml vS S-methionine. After a 12 hour incubation, the conditioned medium is collected, 
concentrated on a spin filter, and loaded onto a \5 r /c SDS gel. The processed gel may be dried 
and exposed to film for a selected period of time to reveal the presence of SRT polypeptide. The 
cultures containing transfected cells may undergo further incubation (in serum free medium > and 
the medium is tested in selected bioassays. 

20 In an alternative technique, SRT may be introduced into 293 cells transiently using the 

dextran sulfate method described by Somparyrac et al., Proc. Natl. Acad. Sci. , 12:7575 (1981). 
293 cells are grown to maximal density in a spinner flask and 700 ug pRK5-SRT DNA is added. 
The cells are first concentrated from the spinner flask by centrifugation and washed with PBS. 
The DNA-dcxtran precipitate is incubated on the cell pellet for four hours. The cells are treated 

25 with 20 ( ?c glycerol for 90 seconds, washed with tissue culture medium, and re-introduced into 
the spinner flask containing tissue culture medium, 5 ug/ml bovine insulin and 0. 1 ug/ml bovine 
transferrin. After about four days, the conditioned media is ccntrifuged and filtered to remove 
cells and debris. The sample containing expressed SRT can then be concentrated and purified 
by any selected method, such as dialysis and/or column chromatography. 

30 

In another embodiment, SRT can be expressed in CHO cells. The pRK5-SRT can be 
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transfectcd into CHO cells using known reagents such as CaP0 4 or DEAE-dcxtran. As described 
above, the cell cultures can be incubated, and the medium replaced with culture medium (alone) 
or medium containing a radiolabel such as \S-methionine. After determining the presence of 
SRT polypeptide, the culture medium may be replaced with serum free medium. Preferably, the 
cultures are incubated for about 6 days, and then the conditioned medium is harvested. The 
5 medium containing the expressed SRT can then be concentrated and purified by any selected 
method. 

Epitope-lagged SRT may also be expressed in host CHO cells. The SRT may be 
subeloned out of the pRK5 vector. The subclone insert can undergo PCR to fuse in frame with 
a selected epitope tag such as a poly-his tag into a Baculovirus expression vector. The poly-his 
1 0 tagged SRT insert can then be subeloned into a SV40 driven vector containing a selection marker 
such as DHFR for selection of stable clones. Finally, the CHO ceils can be transfectcd (as 
described above) with the S V40 driven vector. Labeling may be performed, as described above, 
to verify expression. The culture medium containing the expressed poly-His tagged SRT can 
then be concentrated and purified by any selected method, such as by Nr + -chelatc affinity 
1 5 chromatography. 

SRT may also be expressed in CHO and/or COS cells by a transient expression procedure 
or in CHO cells by another stable expression procedure. 

Stable expression in CHO cells is performed using the following procedure. The proteins 
are expressed as an IgG construct (immunoudhesin), in which the coding sequences for the 
20 soluble forms (e.g. extracellular domains) of the respective proteins are fused to an IgG 1 constant 
region sequence containing the hinge, CH2 and CH2 domains and/or is a poly-His tagged form. 

Following PCR amplification, the respective DNAs are subeloned in a CHO expression 
vector using standard techniques as described in Ausubel et al.. Current Protocols of Molecular 
Biology , Unit 3.16, John Wiley and Sons (1997). CHO expression vectors are constructed to 
25 have compatible restriction sites 5* and 3' of the DNA of interest to allow the convenient 
shuttling of cDNA's. The vector used expression in CHO cells is as described in Lucas et al., 
Nucl. Acids Res. 24:9 ( 1 774- 1 779 (1996), and uses the SV40 early promoter/enhancer to drive 
expression of the cDNA of interest and dihydrofolate reductase (DHFR). DHFR expression 
permits selection for stable maintenance of the plasmid following transfection. 
30 Twelve micrograms of the desired plasmid DNA is introduced into approximately 10 

million CHO cells using commercially available transfection reagents Superfcct 04 (Quiagen), 
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Dosper or Fugene' (Boehringer Mannheim). The eells are grown as described in Lucas et al., 
supra . Approximately 3 x It) 7 cells are frozen in an ampule for further growth and production 
as described below. 

The ampules containing the plusmid DNA are thawed by placement into water bath and 
mixed by vortexing. The contents are pipetted into a centrifuge tube containing 10 mLs of media 
5 and ccntrifugcd at 1000 rpm for 5 minutes. The supernatant is aspirated and the cells are 
resuspended in 10 niL of selective media (0.2 //m filtered PS20 with 5 c /c 0.2 fum diafiltcrcd fetal 
bovine scrum). The cells are then aliquoted into a 100 mL spinner containing 90 mL of selective 
media. After 1-2 days, the cells are transferred into a 250 mL spinner filled with 150 mL 
selective growth medium and incubated at 37°C. After another 2-3 days, 250 mL, 500 mL and 

10 2000 mL spinners are seeded with 3 x 10 s cells/mL. The cell media is exchanged with fresh 
media by centrif ligation and resuspension in production medium. Although any suitable CHO 
media may be employed, a production medium described in U.S. Patent No. 5,122,469, issued 
June 16, 1992 may actually be used. A 3L production spinner is seeded at 1.2 x 10 6 cells/mL. 
On day 0, the cell number pH ie determined. On day 1 , the spinner is sampled and sparging with 

1 5 filtered air is commenced. On day 2, the spinner is sampled, the temperature shifted to 33°C, and 
30 mL of 500 g/L glucose and 0.6 mL of 10% ant i foam (e.g., 359?- polydimethylsiloxane 
emulsion, Dow Corning 365 Medical Grade Emulsion) taken. Throughout the production, the 
pH is adjusted as necessary to keep it at around 7.2. After 10 days, or until the viability dropped 
below 70%, the cell culture is harvested by ccntrifugation and filtering through a 0.22 /um filter. 

20 The filtrate was either stored at 4 ll C or immediately loaded onto columns for purification. 

For the poly-His tagged constructs, the proteins are purified using a Ni-NTA column 
(Qiagen). Before purification, imidazole is added to the conditioned media to a concentration 
of 5 mM. The eonditioncd media is pumped onto a 6 ml Ni-NTA column equilibrated in 20 mM 
Hepes, pH 7.4, buffer containing 0.3 M NaCl and 5 mM imidazole at a flow rate of 4-5 ml/mi n. 

25 at 4 U C. After loading, the column is washed with additional equilibration buffer and the protein 
eluted with equilibration buffer containing 0.25 M imidazole. The highly purified protein is 
subsequently desalted into a storage buffer containing 10 mM Hepes, 0.14 M NaCl and 4% 
mannitol, pH 6.8, with a 25 ml G25 Superfine (Pharmacia) column and stored at -80°C. 

Immunoadhesin (Fc -containing) constructs are purified from the conditioned media as 

30 follows. The conditioned medium is pumped onto a 5 ml Protein A column (Pharmacia) which 
had been equilibrated in 20 mM Na phosphate buffer, pH 6.8. After loading, the column is 
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washed extensively with equilibration buffer before elution with 100 mM citric acid, pU 3.5. 
The eluted protein is immediately neutralized by collecting 1 ml fractions into tubes containing 
275 ju\. of 1 M Tris buffer, pH 9. The highly purified protein is subsequently desalted into 
storage buffer as described above for the polv-His tagged proteins. The homogeneity is assessed 
by SDS polyacrylamide gels and by N-terminal amino acid sequencing by Edman degradation. 

5 

EXAMPLE 6 
Expression of SRT in yeast 
The following method describes recombinant expression of SRT in yeast. 
First, yeast expression vectors are constructed for intracellular production or secretion of 
1 0 SRT from the ADH2/GAPDH promoter. DNA encoding SRT and the promoter is inserted into 
suitable restriction enzyme sites in the selected plasmid to direct intracellular expression of SRT. 
For secretion, DNA encoding SRT can be cloned into the selected plasmid, together with DNA 
encoding the ADH2/GAPDH promoter, a native SRT signal peptide or other mammalian signal 
peptide, or, for example, a yeast alpha-factor or invertase secretory signal/leader sequence, and 
1 5 linker sequences (if needed) for expression of SRT. 

Yeast cells, such as yeast strain ABl 10, can then be transformed with the expression 
plasmids described above and cultured in selected fermentation media. The transformed yeast 
supcrnatants can be analyzed by precipitation with 10% trichloroacetic acid and separation by 
SDS-PAGE, followed by staining of the gels with Coomassie Blue stain. 

20 

Recombinant SRT can subsequently be isolated and purified by removing the yeast cells 
from the fermentation medium by ccntrifugation and then concentrating the medium using 
selected cartridge filters. The concentrate containing SRT may further be purified using selected 
column chromatography resins. 

25 

EXAMPLE 7 

Expression of SRT in baculovirus-infected insect cells 
The following method describes recombinant expression of SRT in Baculovirus-infected 
insect cells. 

30 The sequence coding for SRT is fused upstream of an epitope tag contained within a 

baculovirus expression vector. Such epitope tags include poly-his tags and immunoglobulin tags 
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(like Fc regions of IgG). A variety of plusmids may be employed, including plasmids derived 
from commerciully available plasmids such as pVL1393 (Novagen). Briefly, the sequence 
encoding SRT or the desired portion of the coding sequence of SRT such as the sequence 
encoding the extracellular domain of a transmembrane protein or the sequence encoding the 
mature protein if the protein is extracellular is amplified by PGR w ith primers complementary 
5 to the 5' and 3' regions. The 5' primer may incorporate Hanking (seleeted) restriction enzyme 
sites. The product is then digested with those seleeted restriction enzymes and subcloned into 
the expression vector. 

Recombinant baculovirus is generated by co-trans feet ing the above plasmid and 
BaculoGold™ virus DNA (Pharmingen) into Spodopterafrugipenla ("Sf9") cells (ATCC CRL 

10 171 1) using lipofeetin (commercially available from GIBCO-BRL). After 4-5 days of 
incubation at 28°C. the released viruses are harvested and used for further amplifications. Viral 
infection and protein expression are performed as described by OKcilley et al., Baculovirus 
expression vectors: A Laboratory Manual , Oxford: Oxford University Press (1994). 

Expressed poly-his tagged SRT can then be purified, for example, by Nr + -ehelate affinity 

1 5 chromatography as follows. Extracts arc prepared from recombinant virus-infected Sf9 cells as 
described by Rupert et al. Nature . 362:175-179 (1993). Briefly, Sf9 cells are washed, 
resuspended in sonication buffer (25 mL Hepes, pH 7.9; 12.5 mM MgCL; 0.1 mM EDTA: 10% 
glycerol; 0. 1 % NP-40; 0.4 M KC1), and sonicated twice for 20 seconds on ice. The sonicates are 
cleared by centrifugation, and the supernatant is diluted 50-fold in loading buffer (50 mM 

20 phosphate. 300 mM NaCl, 10% glycerol, pH 7.8) and filtered through a 0.45 /urn filter. A Ni 2+ - 
NTA agarose column (commercially available from Qiagen) is prepared with a bed volume of 
5 mL, washed with 25 mL of water and equilibrated with 25 mL of loading buffer. The filtered 
cell extract is loaded onto the column at 0.5 mL per minute. The column is washed to baseline 
A 280 with loading buffer, at which point fraction collection is started. Next, the column is washed 

25 with a secondary wash buffer (50 mM phosphate; 300 mM NaCl. 10% glycerol, pH 6.0), which 
elutes nonspecifically bound protein. After reaching A 28() baseline again, the column is developed 
with a 0 to 500 mM Imidazole gradient in the secondary wash buffer. One mL fractions are 
collected and analyzed by SDS-PAGE and silver staining or Western blot with Ni 2+ -NTA- 
conjugated to alkaline phosphatase (Qiagen). Fractions containing the eluted His l0 -tagged SRT 

30 are pooled and dialyzed against loading buffer. 

Alternatively, purification of the IgG tagged (or Fc tagged) SRT can be performed using 
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known chromatography techniques, including for instance. Protein A or protein G column 
chromatography. 



EXAMPLE 8 
Preparation of antibodies that bind SRT 
5 This example illustrates preparation of monoclonal antibodies which can specifically bind 

SRT. 

Techniques for producing the monoclonal antibodies are known in the art and are 
described, for instance, in Coding, supra . Immunogens that may be employed include purified 
SRT, fusion proteins containing SRT, and cells expressing recombinant SRT on the cell surface. 
1 0 Seleetion of the immunogen can be made by the skilled artisan without undue experimentation. 

Mice, such as Balb/c, are immunized with the SRT immunogen emulsified in complete 
Freunds adjuvant and injected subculaneously or intraperitoneal ly in an amount from 1 - 1 00 
micrograms. Alternatively, the immunogen is emulsified in MPL-TDM adjuvant (Ribi 
Immunochemical Research, Hamilton, MT) and injected into the animals hind foot pads. The 
1 5 immunized mice are then boosted 10 to 1 2 days later with additional immunogen emulsified in 
the selected adjuvant. Thereafter, for several weeks, the mice may also be boosted with 
additional immunization injections. Serum samples may be periodically obtained from the mice 
by retro-orbital bleeding for testing in ELISA assays to deteet anti-SRT antibodies. 

After a suitable antibody titer has been detected, the animals "positive" for antibodies can 
20 be injected with a final intravenous injection of SRT. Three to four days later, the mice are 
sacrificed and the spleen cells are harvested. The spleen cells are then fused (using 35% 
polyethylene glycol) to a selected murine myeloma cell line such as P3X63AgU. 1 , available from 
ATCC, No. CRL 1597. The fusions generate hybridoma cells which can then be plated in 96 
well tissue culture plates containing HAT (hypoxanthine, aminopterin, and thymidine) medium 
25 to inhibit proliferation of non-fused cells, myeloma hybrids, and spleen cell hybrids. 

The hybridoma cells will be screened in an ELISA for reactivity against SRT. 
Determination of "positive" hybridoma cells secreting the desired monoclonal antibodies against 
SRT is within the skill in the art. 

The positive hybridoma cells can be injected intraperitoneally into syngeneic Balb/c mice 
30 to produce ascites containing the anti-SRT monoclonal antibodies. Alternatively, the hybridoma 
cells can be grown in tissue culture flasks or roller bottles. Purification of the monoclonal 
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antibodies produced in the ascites can be accomplished using ammonium sulfate precipitation, 
followed by gel exclusion chromatography. Alternatively, affinity chromatography based upon 
binding of antibody to protein A or protein G can be employed. 

EXAMPLE 9 

5 Purification of SRT polypeptides using specific antibodies 

Native or recombinant SRT polypeptides may be purified by a variety of standard 
techniques in the art of protein purification. For example. pro-SRT polypeptide, mature SRT 
polypeptide, or pre-SRT polypeptide is purified by immunoaffinity chromatography using 
antibodies specific for the SRT polypeptide of interest. In general, an immunoaffinity column 

10 is constructed by covalently coupling the anti-SRT polypeptide antibody to an activated 
c h ro m atog rap h i c resin. 

Polyclonal immunoglobulins are prepared from immune sera either by precipitation w it h 
ammonium sulfate or by purification on immobilized Protein A (Pharmacia LKB Biotechnology, 
Piscataway, N.J.). Likewise, monoclonal antibodies are prepared from mouse ascites fluid by 

1 5 ammonium sulfate precipitation or chromatography on immobilized Protein A. Partially purified 
immunoglobulin is covalently attached to a chromatographic resin such as CnBr-activated 
SEPH AROSE™ (Pharmacia LKB Biotechnology). The antibody is coupled to the resin, the resin 
is blocked, and the derivative resin is washed according to the manufacturers instructions. 

Such an immunoaffinity column is utilized in the purification of SRT polypeptide by 

20 preparing a fraction from cells containing SRT polypeptide in a soluble form. This preparation 
is derived by solubilization of the whole cell or of a subcellular fraction obtained via differentia! 
centrifugation by the addition of detergent or by other methods well known in the art. 
Alternatively, soluble SRT polypeptide containing a signal sequence may be secreted in useful 
quantity into the medium in which the cells are grown. 

25 A soluble SRT polypeptidc-containing preparation is passed over the immunoaffinity 

column, and the column is washed under conditions that allow the preferential absorbance of 
SRT polypeptide high ionic strength buffers in the presence of detergent). Then, the column 
is eluted under conditions that disrupt antibody/SRT polypeptide binding (e.g., a low pH buffer 
such as approximately pH 2-3, or a high concentration of a chaotrope such as urea or thiocyanate 

30 ion), and SRT polypeptide is collected. 
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EXAMPLE 10 
Drug screening 

This invention is particularly useful tor screening compounds by using SRT polypeptides 
or binding fragment thereof in any of a variety of drug screening techniques. The SRT 
polypeptide or fragment employed in such a test may either be free in solution, affixed to a solid 
5 support, borne on a cell surface, or located intracellularly. One method of drug screening utilizes 
eukaryotic or prokaryotic host cells which are stably transformed with recombinant nucleic acids 
expressing the SRT polypeptide or fragment. Drugs are screened against such transformed cells 
in competitive binding assays. Such cells, either in viable or fixed form, can be used for standard 
binding assays. One may measure, for example, the formation of complexes between SRT 

10 polypeptide or a fragment and the agent being tested. Alternatively, one can examine the 
diminution in complex formation between the SRT polypeptide and its target cell or target 
receptors caused by the agent being tested. 

Thus, the present invention provides methods of screening for drugs or any other agents 
which can affect a SRT polypeptide-associated disease or disorder. These methods comprise 

1 5 contacting such an agent with an SRT polypeptide or fragment thereof and assaying (I) for the 
presence of a complex between the agent and the SRT polypeptide or fragment, or (ii) for the 
presence of a complex between the SRT polypeptide or fragment and the cell, by methods well 
known in the art. In such competitive binding assays, the SRT polypeptide or fragment is 
typically labeled. After suitable incubation, free SRT polypeptide or fragment is separated from 

20 that present in bound form, and the amount of free or uncomplcxed label is a measure of the 
ability of the particular agent to bind to SRT polypeptide or to interfere with the SRT 
polypeptide/cell complex. 

Another technique for drug screening provides high throughput screening for compounds 
having suitable binding affinity to a polypeptide and is described in detail in WO 84/03564, 

25 published on September 13, 1984. Briefly stated, large numbers of different small peptide test 
compounds arc synthesized on a solid substrate, such as plastic pins or some other surface. As 
applied to a SRT polypeptide, the peptide test compounds are reacted with SRT polypeptide and 
washed. Bound SRT polypeptide is detected by methods well known in the art. Purified SRT 
polypeptide can also be coated directly onto plates for use in the aforementioned drug screening 

30 techniques. In addition, non-neutralizing antibodies can be used to capture the peptide and 
immobilize it on the solid support. 
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This invention also contemplates the use of competitive drug screening assays in which 
neutralizing antibodies capable of binding SRT polypeptide specifically compete with a test 
compound for binding to SRT polypeptide or fragments thereof. In this manner, the antibodies 
can be used to detect the presence of any peptide which shares one or more antigenic 
determinants with SRT polypeptide. 

5 

EXAMPLE 1 I 
Rational drug design 

The goal of rational drug design is to produce structural analogs of biologically active 
polypeptide of interest (i.e., a SRT polypeptide) or of small molecules with which they interact, 

10 e.g., agonists, antagonists, or inhibitors. Any of these examples can be used to fashion drugs 
which arc more active or stable forms of the SRT polypeptide or which enhance or interfere with 
the function of the SRT polypeptide in vivo (<•./, Hodgson, Bio/Technology , 9: 19-21 (1991)). 

In one approach, the three-dimensional structure of the SRT polypeptide, or of an SRT 
polypeptide-inhibitor complex, is determined by x-ray crystallography, by computer modeling 

1 5 or, most typically, by a combination of the two approaches. Both the shape and charges of the 
SRT polypeptide must be ascertained to elucidate the structure and to determine active site(s) of 
the molecule. Less often, useful information regarding the structure of the SRT polypeptide may 
be gained by modeling based on the structure of homologous proteins. In both cases, relevant 
structural information is used to design analogous SRT polypeptide-like molecules or to identify 

20 efficient inhibitors. Useful examples of rational drug design may include molecules which have 
improved activity or stability as shown by Braxton and Wells, Biochemistry, 31:7796-7801 
( 1992) or which act as inhibitors, agonists, or antagonists of native peptides as shown by Athauda 
et al, J. Biochem. . 113:742-746 (1993). 

It is also possible to isolate a target-specific antibody, selected by functional assay, as 

25 described above, and then to solve its crystal structure. This approach, in principle, yields a 
pharmacore upon which subsequent drug design can be based. It is possible to bypass protein 
crystallography altogether by generating anti-idiolypic antibodies (anti-ids) to a functional, 
pharmacologically active antibody. As a mirror image of a mirror image, the binding site of the 
anti-ids would be expected to be an analog of the original receptor. The anti-id could then be 

30 used to identify and isolate peptides from banks of chemically or biologically produced peptides. 
The isolated peptides would then act as the pharmacore. 
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By virtue of the present invention, sufficient amounts of the SRT polypeptide may he 
made available to perform such analytical studies as X-ray crystallography. In addition, 
knowledge of the SRT polypeptide amino acid sequence provided herein will provide guidance 
to those employing computer modeling techniques in place of or in addition to x-ray 
crystallography. 

5 

The foregoing written specification is considered to be sufficient to enable one skilled in 
the art to practice the invention. The present invention is not to be limited in scope by the 
construct deposited, since the deposited embodiment is intended as a single illustration of certain 
aspects of the invention and any constructs that are functionally equivalent are within the scope 

1 0 of this invention. The deposit of material herein does not constitute an admission that the written 
description herein contained is inadequate to enable the practice of any aspect of the invention, 
including the best mode thereof, nor is it to be construed as limiting the scope of the claims to 
the specific illustrations that it represents. Indeed, various modifications of the invention in 
addition to those shown and described herein will become apparent to those skilled in the art 

15 from the foregoing description and fall within the scope of the appended claims. 
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